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NOVEL COMPOUNDS 

Field nf the Invention: 

This invention relates to newly identified polynucleotides, polypeptides encoded by 
such polynucleotides, the uses of such polynucleotides and polypeptides, as well as the 

5 production of such polynucleotides and polypeptides and recombinant host cells 
transformed with the polynucleotides. This invention also relates to inhibiting the 
biosynthesis or action of such polynucleotides or polypeptides and to the use of such 
inhibitors in therapy. Preferred embodiments of the invention include Streptococcal 
polypeptides and polynucleotides, particularly those of Streptococcus pneumoniae. 

10 Rftckground of the Invention; 

The Streptococci make up a medically important genera of microbes known to 
cause several types of disease in humans, including otitis media, pneumonia and meningitis. 
Since its isolation more than 100 years ago. Streptococcus pneumoniae has been one of the 
more intensively studied microbes. For example, much of our early understanding that 

15 DNA is, in fact, the genetic material was predicated on the work of Griffith and of Avery, 
Macleod and McCarty using this microbe. Despite the vast amount of research with S. 
pneumoniae, many questions concerning the virulence of this microbe remain. 

While certain Streptococcal factors associated with pathogenicity have been 
identified, e.g., capsule polysaccharides, peptidoglycans, pneumolyses, PspA Complement 

20 factor H binding component, autolysin, neuraminidase, peptide permeases, hydrogen 
peroxide, IgAl protease, the list is certainly not complete. Further very little is known 
concerning the temporal expression of such genes during infection and disease progression 
in a mammalian host. Discovering the sets of genes the bacterium is likely to be expressing 
at the different stages of infection, particularly when an infection is established, provides 

25 critical information for the screening and characterization of novel antibacterials which can 
interrupt pathogenesis. In addition to providing a fuller understanding of known proteins, 
such an approach will identify previously unrecognised targets. 
Brief Description of the Invention: 

This invention provides novel proteins, particularly those from Streptococcus 

30 pneumoniae, strain 0100993, characterised in that it comprises the amino acid sequences 
given herein or fragments, analogues or derivatives thereof. 

In accordance with another aspect of the present invention, there are provided 
polynucleotides (DNA or RNA) which encode such polypeptides. 



1 
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In particular the invention provides polynucleotides having the DNA sequences 
given herein. 

The invention also relates to novel oligonucleotides derived from the sequences 
given herein which can act as PCR primers in the process herein described to determine 
5 whether or not the Streptococcus pneumoniae gene identified herein in whole or in part is 
expressed in infected tissue. It is recognised that such sequences will also have utility in 
diagnosis of the stage of infection and type of infection the pathogen has attained. The 
proteins so identified are also useful as targets in screens designed to identify antimicrobial 
compounds. 

10 It is an object of the invention to provide polypeptides that have been identified as novel 

polypeptides by homology between the amino acid sequence set out in the Sequence Listing and 
aknown amino acid sequence or sequences of other proteins such as the proteins identified under 

the heading Identity in Table 1. 

It is a further object of the invention to provide polynucleotides that encode polypeptides 
15 set forth in the Sequence Listing, particularly polynucleotides that encode the polypeptide set 

forth in the Sequence Listing. 

In a particularly preferred embodiment of the invention the polynucleotide comprises a 
region encoding polypeptides comprising the sequence set out in the Sequence Listing, or a 
variant thereof. 

20 In another particularly preferred embodiment of the invention there is a novel 

protein from comprising an amino acid sequence of the Sequence Listing, or a variant 
thereof. 

In accordance with another aspect of the invention, there is provided the use of a 
polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
25 genetic immunization. Among the particularly preferred embodiments of the invention are 
naturally occurring allelic variants the polynucleotides set forth in the Sequence Lisung and 

polypeptides encoded thereby. 

In accordance with yet another aspect of the invention, there are provided inhibitors 
to such polypeptides, useful as antibacterial agents, including, for example, antibodies. 
30 In accordance with certain preferred embodiments of the invention, there are provided 

products, compositions and methods for assess** expression of the sequences the Sequence 
Lisung treating disease, for example, otitis media, conjunctivitis, pneumonia, bacteremia, 
meningitis, sinusitis, pleural empyema and endocarditis, and most particularly meninges, 
such as for example infection of cerebrospinal fluid, assaying genetic variauon, and 
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administering a polypeptide or polynucleotide of the invention to an organism to raise an 
immunological response against a bacteria, especially a Streptococcus pneumoniae bacteria. 

In accordance with certain preferred embodiments of this and other aspects of the 
invention there are provided polynucleotides that hybridize to polynucleotide sequences of the 
S invention, particularly under stringent conditions. 

In certain preferred embodiments of the invention there are provided antibodies against 
polypeptides of the invention. 

In other embodiments of the invention there are provided methods for identifying 
compounds which bind to or otherwise interact with and inhibit or activate an activity of a 
1 0 polypeptide or polynucleotide of the invention comprising: contacting a polypeptide or 

polynucleotide of the invention with a compound to be screened under conditions to permit 
binding to or other interaction between the compound and the polypeptide or polynucleotide to 
assess the binding to or other interaction with the compound, such binding or interaction being 
associated with a second component capable of providing a detectable signal in response to the 
1 5 binding or interaction of the polypeptide or polynucleotide with the compound; and determining 
whether the compound binds to or otherwise interacts with and activates or inhibits an activity of 
the polypetide or polynucleotide by detecting the presence or absence of a signal generated from 
the binding or interaction of the compound with the polypeptide or polynucleotide. 

In accordance with yet another aspect of the invention* there are provided agonists and 
20 antagonists of the polynucleotide and/or polypeptides of the invention, preferably bacteriostatic 
or bacteriocidal agonists and antagonists. 

In a further aspect of the invention there are provided compositions comprising a 
polynucleotide or a polypeptide of the invention for administration to a cell or to a multicellular 
organism. 

25 Various changes and modifications within the spirit and scope of the disclosed invention 

will become readily apparent to those skilled in the art from reading the following descriptions 
and from reading the other parts of the present disclosure. 
Detailed Description of the Invention; 

Each of the DNA sequences provided herein may be used in the discovery and 

30 development of antibacterial compounds. Because each of the sequences contains an open 
reading frame (ORF) with appropriate initiation and termination codons, the encoded 
protein upon expression can be used as a target for the screening of antimicrobial drugs. 
Additionally, the DNA sequences encoding preferably the ammo terminal regions of the 
encoded protein or the Shine-Delgamo region cr*n be used to construct antisense sequences 
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to control the expression of the coding sequence of interest. Furthermore, many of the 
sequences disclosed herein also provide regions upstream and downstream from the 
encoding sequence. These sequences are useful as a source of regulatory elements for the 
control of bacterial gene expression. Such sequences are conveniently isolated by 
5 restriction enzyme action or synthesized chemically and introduced, for example, into 
promoter identification strains. These strains contain a reporter structural gene sequence 
located downstream from a restriction site such that if an active promoter is inserted, the 
reporter gene will be expressed. 

Although each of the sequences may be employed as described above, this 
10 invention also provides several means for identifying particularly useful target genes. The 
first of these approaches entails searching appropriate databases for sequence matches in 
related organisms. Thus, if a homologue exists, the Streptococcal-like form of this gene 
would likely play an analogous role. For example, a Streptococcal protein identified as 
homologous to a ceil surface protein in another organism would be useful as a vaccine 
15 candidate. To the extent such homologies have been identified for the sequences disclosed 
herein they are reported along with the encoding sequence. 

Recently techniques have become available to evaluate temporal gene expression in 
bacteria, particularly as it applies to viability under laboratory and infection conditions. A 
number of methods can be used to identify genes which are essential to survival per se, or 
20 essential to the establishment/maintenance of an infection. Identification of an ORF 
unknown by one of these methods yields additional information about its function and 
permits the selection of such an ORF for further development as a screening target. Briefly, 
these approaches include: 
1) Signature Tagged Mutagenesis (STM) 
25 This technique is described by Hensel ei al, Sfiknfifi 269: 400-403( 1995), the 

contents of which is incorporated by reference for background purposes. 
Signature tagged mutagenesis identifies genes necessary for the establishment/maintenance 
of infection in a given infection model. 

The basis of the technique is the random mutagenesis of target organism by various 
30 means (e.g.. transposons) such that unique DNA sequence tags are inserted in close 

proximity to the site of mutation. The tags from a mixed population of bacterial mutants 
and bacteria recovered from an infected hosts are detected by amplification, radiolabeling 
and hybridisation analysis. Mutants attenuated in virulence are revealed by absence of the 
tag from the pool of bacteria recovered from infected hosts. 
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In Streptococcus pneumoniae, because the transposon system is less well 
developed, a more efficient way of creating the tagged mutants is to use the insertion- 
duplication mutagenesis technique as described by Morrison et al.. J. Bacteriol. 159:870 
(1984) the contents of which is incorporated by reference for background purposes. 

5 2) In Vivo Expression Technology (IVET) 

This technique is described by Camilli & at, £rc£. Mail. Acad- Sfii USA 91 :2634- 
2638 (1994), the contents of which is incorporated by reference for background purposes. 
IVET identifies genes up-regulated during infection when compared to laboratory 
cultivation, implying an important role in infection. ORF identified by this technique are 

10 implied to have a significant role in infection establishment/maintenance. 

In this technique random chromosomal fragments of target organism are cloned 
upstream of a promoter-less recombinase gene in a plasmid vector. This construct is 
introduced into the target organism which carries an antibiotic resistance gene flanked by 
resolvase sites. Growth in the presence of the antibiotic removes from the population those 

15 fragments cloned into the plasmid vector capable of supporting transcription of the 

recombinase gene and therefore have caused loss of antibiotic resistance. The resistant pool 
is introduced into a host and at various times after infection bacteria may be recovered and 
assessed for the presence of antibiotic resistance. The chromosomal fragment carried by 
each antibiotic sensitive bacterium should carry a promoter or portion of a gene normally 

20 upregulated during infection. Sequencing upstream of the recombinase gene allows 
identification of the up regulated gene. 
3) Differential display 

This technique is described by Chuang eial, J- Bacteriol . 175:2026-2036(1993), 
the contents of which is incorporated by reference for background purposes. This method 

25 identifies those genes which are expressed in an organism by identifying mRNA present 
using randomly-primed RT-PCR. By comparing pre-infection and post infection profiles, 
genes up and down regulated during infection can be identified and the RT-PCR product 
sequenced and matched to ORF 'unknowns'. 



5 
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4) Generation of conditional lethal mutants by transposon mutagenesis. 

This technique, described by de Lorenzo, V. £iaL, Qens 123:17-24 (1993); 
Neuwald, A. F. £l aL, GSD& 125: 69-73(1993); and Takiff, H. E. si aL J Bacterid- 
174:1544-1553(1992), the contents of which is incorporated by reference for 

5 background purposes, identifies genes whose expression are essential for cell viability. 

In this technique transposons carrying controllable promoters, which provide 
transcription outward from the transposon in one or both directions, are generated. Random 
insertion of these transposons into target organisms and subsequent isolation of insertion 
mutants in the presence of inducer of promoter activity ensures that insertions which 

10 separate promoter from coding region of a gene whose expression is essential for cell 

viability will be recovered. Subsequent replica plating in the absence of inducer identifies 
such insertions, since they fail to survive. Sequencing of the flanking regions of the 
transposon allows identification of site of insertion and identification of the gene disrupted. 
Close monitoring of the changes in cellular processes/morphology during growth in the 

15 absence of inducer yields information on likely function of the gene. Such monitoring 
could include flow cytometry (cell division, lysis, redox potential, DNA replication), 
incorporation of radiochemically labeled precursors into DNA, RNA, protein, lipid, 
peptidoglycan, monitoring reporter enzyme gene fusions which respond to known cellular 
stresses. 

20 5) Generation of conditional lethal mutants by chemical mutagenesis. 

This technique is described by Beckwith, 1 Methods in Enzvmologv 204: 
3-18(1991), the contents of which are incorporated herein by reference for background 
purposes. In this technique random chemical mutagenesis of target organism, growth at 
temperature other than physiological temperature (permissive temperature) and subsequent 

25 replica plating and growth at different temperature (e.g. 42°C to identify ts, 25°C to identify 
cs) are used to identify those isolates which now fail to grow (conditional mutants). As 
above close monitoring of the changes upon growth at the non-permissive temperature 
yields information on the function of the mutated gene. Complementation of conditional 
lethal mutation by library from target organism and sequencing of complementing gene 

30 allows matching with unknown ORF. 
6) RT-PCR 

Streptococcus pneumoniae messenger RNA is isolated from bacterial infected tissue e.g. 48 
hour murine lung infections, and the amount of each mRNA species assessed by reverse 
transcription of the RNA sample primed with random hexanucleotides followed by PCR 
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with gene specific primer pairs. The determination of the presence and amount of a 
particular mRNA species by quantification of the resultant PGR product provides 
information on the bacterial genes which are transcribed in the infected tissue. Analysis of 
gene transcription can be carried out at different times of infection to gain a detailed 
5 knowledge of gene regulation in bacterial pathogenesis allowing for a clearer understanding 
of which gene products represent targets for screens for novel antibacterials. Because of 
the gene specific nature of the PCR primers employed it should be understood that the 
bacterial mRNA preparation need not be free of mammalian RNA. This allows the 
investigator to carry out a simple and quick RNA preparation from infected tissue to 
10 obtain bacterial mRNA species which are very short lived in the bacterium (in the order of 2 
minute halflives). Optimally the bacterial mRNA is prepared from infected murine lung 
tissue by mechanical disruption in the presence of TRIzole (GIBCO-BRL) for very short 
periods of time, subsequent processing according to the manufacturers of TRIzole reagent 
and DNAase treatment to remove contaminating DNA. Preferably the process is optimised 
1 5 by Finding those conditions which give a maximum amount of S.pneumoniae 16S 

ribosomal RNA as detected by probing Northerns with a suitably labelled sequence specific 
oligonucleotide probe. Typically a 5' dye labelled primer is used in each PCR primer pair 
in a PCR reaction which is terminated optimally between 8 and 25 cycles. The PCR 
products are separated on 6% polyacrylamide gels with detection and quantification using 
20 GeneScanner (manufactured by ABI). 

Each of these techniques may have advantages or disadvantage depending on the 
particular application. The skilled artisan would choose the approach that is the most 
relevant with the particular end use in mind. For example, some genes might be recognised 
as essential for infection but in reality are only necessary for the initiation of infection and 
25 so their products would represent relatively unattractive targets for antibacterials developed 
to cure established and chronic infections. 

Use of the of these technologies when applied to the ORFs of the present invention 
enables identification of bacterial proteins expressed during infection, inhibitors of which 
would have utility in anti-bacterial therapy. 
30 Streptococcus pneumoniae, strain 0 1 00993 has been deposited at the National 

Collection of Industrial and Marine Bacteria Ltd (NCIMB), Aberdeen, Scotland under 
NCIMB number 40794 on 1 1 April 1996, and a Streptococcus pneumoniae, strain 0100993 
DNA library in E. coli was similarly deposited on 17 April 1996 under NCIMB number 
40800. 



7 



PCT/US97/05306 

WO 97/37026 

The nucleotide sequences disclosed herein can be obtained by synthetic chemical 
techniques known in the art or can be obtained from Streptococcus pneumoniae, strain 
0100993by probing a DNA preparation with probes constructed from the particular 
sequences disclosed herein. Alternatively, oligonucleotides derived from a disclosed 
5 sequence can act as PCR primers in a process of PCR-based cloning of the sequence from a 
bacterial genomic source. It is recognised that such sequences will also have utility in 
diagnosis of the stage of infection and type of infection the pathogen has attained. 

To obtain the polynucleotide encoding the protein using the DNA sequence given 
herein typically a library of clones of chromosomal DNA of Streptococcus pneumoniae, 
10 strain 0100993 in £. coli or some other suitable host is probed with a radiolabelled 

oligonucleotide, preferably a 17mer or longer, derived from the partial sequence. Clones 
carrying DNA identical to that of the probe can then be distinguished using high stringency 
washes. By sequencing the individual clones thus identified with sequencing primers 
designed from the original sequence it is then possible to extend the sequence in both 
15 directions to determine the full gene sequence. Conveniently such sequencing is performed 
using denatured double stranded DNA prepared from a plasmid clone. Suitable techniques 
are described by Maniatis, T., Fritsch, E.F. and Sambrook, J. in MOLECULAR CLONING, 
A Laboratory Manual, 2nd edition, 1989, Cold Spring Harbor Laboratory (see: Screening 
By Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Templates 
20 13.70). 

A polynucleotide of the present invention may be in the form of RNA or in the 
form of DNA, which DNA includes cDNA, genomic DNA, and synthetic DNA. The DNA 
may be double-stranded or single-stranded, and if single stranded may be the coding strand 
or non-coding (anti-sense) strand. The coding sequence which encodes the polypeptide 
25 may be identical to the coding sequence shown or may be a different coding sequence 
which coding sequent, as a result of the redundancy or degeneracy of the genetic code, 
encoding the same polypeptide. 

The present invention includes variants of the hereinabove described 
polynucleotides which encode fragments, analogues and derivatives of the polypeptide 
30 characterized by the deduced amino acid sequence given herein. The variant of the 

polynucleotide may be a naturally occurring allelic variant of the polynucleotide or a non- 
naturally occurring variant of the polynucleotide. In addition to the standard A, G, C, T/U 
representations for nucleic acid bases, the term "N" is also used. "N" means that any of the 
four DNA or RNA bases may appear at such a designated position in the DNA or RNA 
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sequence, except that N cannot be a base that when taken in combination with adjacent 
nucleotide positions, when read in the correct reading frame, would have the effect of 
generating a premature termination codon in such reading frame. 

Thus, the present invention includes polynucleotides encoding the same 
S polypeptide characterized by the deduced amino acid sequence given herein as well as 
variants of such polynucleotides which variants encode for a fragment, derivative or 
analogue of the polypeptide. Such nucleotide variants include deletion variants, 
substitution variants and addition or insertion variants. 

The polynucleotide may have a coding sequence which is a naturally occurring 
10 allelic variant of the coding sequence characterized by the DNA sequence disclosed herein. 
As known in the art, an allelic variant is an alternate form of a polynucleotide sequence 
which may have a substitution, deletion or addition of one or more nucleotides, which does 
not substantially alter the function of the encoded polypeptide. 

The polynucleotide which encodes for the mature polypeptide, may include only 
IS the coding sequence for the mature polypeptide or the coding sequence for the mature 
polypeptide and additional coding sequence such as a leader or secretory sequence or a 
proprotein sequence. 

Thus, the term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well as a 
20 polynucleotide which includes additional coding and/or non-coding sequence. 

The present invention therefore includes polynucleotides, wherein the coding 
sequence for the mature polypeptide may be fused in the same reading frame to a 
polynucleotide sequence which aids in expression and secretion of a polypeptide from a 
host cell, for example, a leader sequence which functions as a secretory sequence for 
25 controlling transport of a polypeptide from the cell. The polypeptide having a leader 

sequence is a preprotein and may have the leader sequence cleaved by the host cell to form 
the mature form of the polypeptide. The polynucleotides may also encode for a proprotein 
which is the mature protein plus additional 5' amino acid residues. A mature protein having 
a prosequence is a proprotein and may be an inactive form of the protein. Once the 
30 prosequence is cleaved an active mature protein remains. 

Thus, for example, the polynucleotide of the present invention may code for a 
mature protein, or for a protein having a prosequence or for a protein having both a 
prosequence and a presequence (leader sequence). Further, the amino acid sequences 
provided herein show a methionine residue at the NH 2 -terminus. It is appreciated, however, 
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that during post-translational modification of the peptide, this residue may be deleted. 
Accordingly, this invention contemplates the use of both the methionine-containing and the 
methionineless amino terminal variants of each protein disclosed herein. 

The polynucleotides of the present invention may also have the coding sequence 
5 fused in frame to a marker sequence at either the 5* or 3' terminus of the gene which allows 
for purification of the polypeptide of the present invention. The marker sequence may be a 
hexa-histidine tog supplied by the pQE series of vectors (supplied commercially by Quiagen 
Inc.) to provide for purification of the polypeptide fused to the marker in the case of a 
bacterial host. 

10 The present invention further relates to polynucleotides which hybridize to the 

hereinabove-described sequences if there is at least 50% and preferably at least 70% identity 
between the sequences. The present invention particularly relates to polynucleotides, 
particularly Streptococcal polynucleotides, which hybridize under stringent conditions to 
the hereinabove-described polynucleotides . As herein used, the term "stringent conditions" 
15 means hybridization will occur only if there is at least 95% and preferably at least 97% 
identity between the sequences. The polynucleotides which hybridize to the hereinabove 
described polynucleotides in a preferred embodiment encode polypeptides which retain 
substantially the same biological function or activity as the polypeptide characterised by the 
deduced amino acid sequence given herein. An example of stringent hybridization 
20 conditions is overnight incubation at 42°C in a solution comprising: 50% formamide, 5x 
SSC (150mM NaCI, 15mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x 
Denhardt's solution, 10% dextran sulfate, and 20 micrograms/ml denatured, sheared salmon 
sperm DNA, followed by washing the hybridization support in 0.1 x SSC at about 65°C. 
Hybridization and wash conditions are well known and exemplified in Sambrook, et a/., 
25 Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor. N.Y., 
(1989), particularly Chapter 1 1 therein. 

The invention also provides a polynucleotide consisting essentially of a 
polynucleotide sequence obtainable by screening an appropriate library containing the 
complete gene for a polynucleotide sequence selected from the group consisting of the 
30 polynucleotides of the Sequence Listing under stringent hybridization conditions with a 
probe having the sequence of said polynucleotide sequence set forth in said polynucleotide 
of the Sequence Listing or a fragment thereof; and isolating said DNA sequence. 
Fragments useful for obtaining such a polynucleotide include, for example, probes and 
primers described elsewhere herein. 

10 
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"Identity, " as known in the ait and used herein, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by comparing 
the sequences. In the art, "identity" also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as the case may be, as determined by the match 
5 between strings of such sequences. "Identity" and "similarity" can be readily calculated by 
known methods, including but not limited to those described in (Computational Molecular 
Biology, Lesk, A.M., ed„ Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., 
1 0 Humana Press, New Jersey, 1 994; Sequence Analysis in Molecular Biology, von Heinje, G„ 
Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., 
M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., SIAM / Applied 
Math,, 48: 1073 (1988). Preferred methods to determine identity are designed to give the 
largest match between the sequences tested. Methods to determine identity and similarity 

1 5 are codified in publicly available computer programs. Preferred computer program methods 
to determine identity and similarity between two sequences include, but are not limited to, 
the GCG program package (Devereux, J., et a!.. Nucleic Acids Research 12(1): 387 (1984)), 
BLASTP, BLASTN, and FASTA (Atschul, S.R et al., / Molec. Biol. 215: 403-410 (1990). 
The BLAST X program is publicly available from NCBI and other sources (BLAST 

20 Manual, Altschul, S„ et al., NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. 
Mol. BioL 215: 403-410 (1990). 

As an illustration, by a polynucleotide having a nucleotide sequence having at least, 
for example, 95% "identity" to a reference nucleotide sequence selected from the group 
consisting of the polynucleotide of the Sequence Listing is intended that the nucleotide 

25 sequence of the polynucleotide is identical to the reference sequence except that the 

polynucleotide sequence may include up to five point mutations per each 100 nucleotides of 
the reference nucleotide sequence. In other words, to obtain a polynucleotide having a 
nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% of 
the nucleotides in the reference sequence may be deleted or substituted with another 

30 nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference 
sequence may be inserted into the reference sequence. These mutations of the reference 
sequence may occur at the 5 or 3 terminal positions of the reference nucleotide sequence 
or anywhere between those terminal positions, interspersed either individually among 
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nucleotides in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

Similarly, by a polypeptide having an amino acid sequence having at least, 
for example, 95% identity to a reference amino acid sequence of selected from the 
5 group consisting of the amino acids of the Sequence Listing is intended that the 
amino acid sequence of the polypeptide is identical to the reference sequence except 
that the polypeptide sequence may include up to five amino acid alterations per 
each 100 amino acids of the reference amino acid. In other words, to obtain a 
polypeptide having an amino acid sequence at least 95% identical to a reference 
10 amino acid sequence, up to 5% of the amino acid residues in the reference sequence 
may be deleted or substituted with another amino acid, or a number of amino acids 
up to 5% of the total amino acid residues in the reference sequence may be inserted 
into the reference sequence. These alterations of the reference sequence may occur 
at the amino or carboxy terminal positions of the reference amino acid sequence or 
15 anywhere between those terminal positions, interspersed either individually among 
residues in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

The deposit referred to herein will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Micro-organisms for purposes of 

20 Patent Procedure. These deposits are provided merely as convenience to those of skill in 
the art and are not an admission that a deposit is required under 35 U.S.C. §1 12. The 
sequence of the polynucleotides contained in the deposited material, as well as the amino 
acid sequence of the polypeptides encoded thereby, are incorporated herein by reference and 
are controlling in the event of any conflict with any description of sequences herein. A 

25 license may be required to make, use or sell the deposited material, and no such license is 
hereby granted. 

The terms "fragment," "derivative" and "analogue" when referring to the 
polypeptide characterized by the deduced amino acid sequence herein, means a polypeptide 
which retains essentially the same biological function or activity as such polypeptide. 
30 Thus, an analogue includes a proprotein which can be activated by cleavage of the 
proprotein portion to produce an active mature polypeptide. 

The polypeptide of the present invention may be a recombinant polypeptide, a 
natural polypeptide or a synthetic polypeptide, preferably a recombinant polypepdde. 
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The fragment, derivative or analogue of the polypeptide characterized by the 
deduced amino acid sequence herein may be (i) one in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not be 
5 one encoded by the genetic code, or (ii) one in which one or more of the amino acid 
residues includes a substituent group, or (iii) one in which the polypeptide is fused with 
another compound, such as a compound to increase the half-life of the polypeptide (for 
example, polyethylene glycol), or (iv) one in which the additional amino acids are fused to 
the polypeptide, such as a leader or secretory sequence or a sequence which is employed for 
10 purification of the polypeptide or a proprotein sequence. Such fragments, derivatives and 
analogues are deemed to be within the scope of those skilled in the an from the teachings 
herein. 

The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 

IS The term "isolated" means that the material is removed from its original 

environment (e.g., the natural environment if it is naturally occurring). For example, a 
naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, 
but the same polynucleotide or polypeptide, separated from some or all of the coexisting 
materials in the natural system, is isolated. Such polynucleotides could be part of a vector 

20 and/or such polynucleotides or polypeptides could be pan of a composition, and still be 
isolated in that such vector or composition is not pan of its natural environment. 

The present invention also relates to vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. 

25 In accordance with yet a further aspect of the present invention, there is therefore 

provided a process for producing the polypeptide of the invention by recombinant 
techniques by expressing a polynucleotide encoding said polypeptide in a host and 
recovering the expressed product. Alternatively, the polypeptides of the invention can be 
synthetically produced by conventional peptide synthesizers. 

30 Host cells are genetically engineered (transduced or transformed or transfected) 

with the vectors of this invention which may be, for example, a cloning vector or an 
expression vector. The vector may be, for example, in the form of a plasmid, a cosmid, a 
phage, etc. The engineered hos^ cells can be cultured in conventional nutrient media 
modified as appropriate for activating promoter,, reletting transforroants or amplifying the 
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genes. The culture conditions, such as temperature, pH and the like, are those previously 
used with the host cell selected for expression, and will be apparent to the ordinarily skilled 
artisan. 

Suitable expression vectors include chromosomal, nonchromosomal and synthetic 
5 DN A sequences, e.g., bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA. However, any other vector may 
be used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate restriction 
10 endonuclease site(s) by procedures known in the art. 

The DNA sequence in the expression vector is operatively linked to an appropriate 
expression control sequence(s) (promoter) to direct mRNA synthesis. As representative 
examples of such promoters, there may be mentioned: LTR or SV40 promoter, the E. coli 
lac or trp, the phage lambda Pl promoter and other promoters known to control expression 
1 5 of genes in eukaryotic or prokary otic cells or their viruses. The expression vector also 

contains a ribosome binding site for translation initiation and a transcription terminator. The 
vector may also include appropriate sequences for amplifying expression. 

In addition, the expression vectors preferably contain one or more selectable marker 
genes to provide a phenotypic trait for selection of transformed host cells such as 
20 dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E. colt. 

The gene can be placed under the control of a promoter, ribosome binding site (for 
bacterial expression) and, optionally, an operator (collectively referred to herein as "control" 
elements), so that the DNA sequence encoding the desired protein is transcribed into RNA 
25 in the host cell transformed by a vector containing this expression construction. The coding 
sequence may or may not contain a signal peptide or leader sequence. The polypeptides of 
the present invention can be expressed using, for example, the E. coli tac promoter or the 
protein A gene (spa) promoter and signal sequence. Leader sequences can be removed by 
the bacterial host in post-translational processing. See, e.g., U.S. Patent Nos. 4,431.739; 
30 4,425,437; 4,338,397. Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are PKK232-8 and PCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpt, lambda P Rl P L and trp. Eukaryotic promoters include CMV 
immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 

14 



WO 97/37026 PCT/US97/05306 
mouse metallothionein-I. Selection of the appropriate vector and promoter is well within the 
level of ordinary skill in the art. 

In addition to control sequences, it may be desirable to add regulatory sequences 
which allow for regulation of the expression of the protein sequences relative to the growth 
5 of the host cell. Regulatory sequences are known to those of skill in the art, and examples 
include those which cause the expression of a gene to be turned on or off in response to a 
chemical or physical stimulus, including the presence of a regulatory compound. Other 
types of regulatory elements may also be present in the vector, for example, enhancer 
sequences. 

10 An expression vector is constructed so that the particular coding sequence is located 

in the vector with the appropriate regulatory sequences, the positioning and orientation of 
the coding sequence with respect to the control sequences being such that the coding 
sequence is transcribed under the "control" of the control sequences (i.e., RNA polymerase 
which binds to the DNA molecule at the control sequences transcribes the coding 

15 sequence). Modification of the ceding sequences may be desirable to achieve this end. For 
example, in some cases it may be necessary to modify the sequence so that it may be 
attached to the control sequences with the appropriate orientation; i.e., to maintain the 
reading frame. The control sequences and other regulatory sequences may be ligated to the 
coding sequence prior to insertion into a vector, such as the cloning vectors described 

20 above. Alternatively, the coding sequence can be cloned directly into an expression vector 
which already contains the control sequences and an appropriate restriction site. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly- 

25 expressed gene to direct transcription of a downstream structural sequence. The 
heterologous structural sequence is assembled in appropriate phase with translation 
initiation and termination sequences, and preferably, a leader sequence capable of directing 
secretion of translated protein into the periplasms space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N-terminal 

30 identification peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

The vector containing the appropriate DNA sequence as hereinabove described, as 
well as an appropriate promoter or control sequence, may be employed 10 transform an 
appropriate host to permit the host to express the protein. 

15 
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More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 

5 embodiment, the construct further comprises regulatory sequences, including, for example, 
a promoter, operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially available. The 
following vectors are provided by way of example. Bacterial: pET-3 vectors (Stratagene), 
pQE70. pQE60, pQE-9 (Qiagen). pbs. pDIO, phagescript, psiX174. pbluescript SK, pbsks. 

10 pNH8A, pNHlrja, pNH18A, pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3. 

pDR540, pRTT5 (Pharmacia). Eukaryotic: pBlueBacffl (Invitrogen), pWLNEO. pSV2CAT, 
pOG44, pXTl, pSG (Stratagene) pSVK3. pBPV, pMSG. pSVL (Pharmacia). However, 
any other plasmid or vector may be used as long as they are replicable and viable in the 
host. 

1 5 Examples of recombinant DN A vectors for cloning and host cells which they can 

transform include the bacteriophage 1 (£. coli), pBR322 (£. coli), pACYC177 (£. coli), 
pKT230 (gram-negative bacteria), pGVl 106 (gram-negative bacteria), pLAFRl (gram- 
negative bacteria). P ME290 (non-£. coli gram-negative bacteria), pHVH (£. coli and 
Bacillus subtilis), P BD9 (Bacillus), pU61 (Streptomyces), pUC6 (Streptomyces), YIp5 
20 (Saccharomyces), a baculovirus insect cell system, YCpl9 (Saccharomyces). See, 

generally. "DNA Cloning": Vols. I & II. Glover et al. ed. IRL Press Oxford (1985) (1987) 
and; T. Maniatis et al. ("Molecular Cloning" Cold Spring Harbor Laboratory (1982). 

In some cases, it may be desirable to add sequences which cause the secretion of the 
polypeptide from the host organism, with subsequent cleavage of the secretory signal. 
25 Polypeptides can be expressed in host cells under the control of appropriate 

promoters. Cell-free translation systems can also be employed to produce such proteins 
using RNAs derived from the DNA constructs of the present invention. Appropriate 
cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described 
by Sambrook. et al.. Molecular Cloning: A Laboratory Manual. Second Edition, Cold 
30 Spring Harbor, N.Y.. (1989). the disclosure of which is hereby incorporated by reference. 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.g.. 
temperature shift or chemical induction) and cells are cultured for an additional period. 



1.6 



WO 97/37026 PCT/US97/05306 
Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use 
5 of cell lysing agents, such methods are well known to those skilled in the art. 

Depending on the expression system and host selected, the polypeptide of the 
present invention may be produced by growing host cells transformed by an expression 
vector described above under conditions whereby the polypeptide of interest is expressed. 
The polypeptide is then isolated from the host cells and purified. If the expression system 
10 secretes the polypeptide into growth media, the polypeptide can be purified directly from 
the media. If the polypeptide is not secreted, it is isolated from cell lysates or recovered 
from the cell membrane fraction. Where the polypeptide is localized to the cell surface, 
whole cells or isolated membranes can be used as an assayable source of the desired gene 
product. Polypeptide expressed in bacterial hosts such as E. coli may require isolation from 
15 inclusion bodies and refolding. Where the mature protein has a very hydrophobic region 
which leads to an insoluble product of overexpression, it may be desirable to express a 
truncated protein in which the hydrophobic region has been deleted. The selection of the 
appropriate growth conditions and recovery methods are within the skill of the art. 

The polypeptide can be recovered and purified from recombinant cell cultures by 
20 methods including ammonium sulphate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, as necessary, in completing 
configuration of the mature protein. Finally, high performance liquid chromatography 
25 (HPLC) can be employed for final purification steps. 

Depending upon the host employed in a recombinant production procedure, the 
polypeptides of die present invention may be glycosylated or may be non-glycosylated. 
Polypeptides of the invention may also include an initial methionine amino acid residue. 
A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that 
30 functions as an autonomous unit of DNa replication in vivo; i.e., capable of replication 
under its own control. 

A "vector" is a replicon, such as a plasmid, phage, or cosmid, to which another 
DNA segment may be attached so as to bring about the replication of the attached segment. 
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A "double-stranded DNA molecule" refers to the polymeric form of 
deoxyribonucleotides (bases adenine, guanine, thymine, or cytosine) in a double-stranded 
helix, both relaxed and supercoiled. This term refers only to the primary and secondary 
structure of the molecule, and does not limit it to any particular tertiary forms. Thus, this 

5 term includes double-stranded DNA found, inter alia, in linear DNA molecules (e.g., 

restriction fragments), viruses, plasmids, and chromosomes. In discussing the structure of 
particular double-stranded DNA molecules, sequences may be described herein according to 
the normal convention of giving only the sequence in the 5' to 3' direction along the 
nontranscribed strand of DNA (i.e.. the strand having the sequence homologous to the 

10 mRNA). 

A DNA "coding sequence of or a "nucleotide sequence encoding" a particular 
protein, is a DNA sequence which is transcribed and translated into a polypeptide when 
placed under the control of appropriate regulatory sequences. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
15 polymerase in a cell and initiating transcription of a downstream (3' direction) coding 

sequence. For purposes of defining the present invention, the promoter sequence is bound 
at the y terminus by a translation start codon (e.g.. ATG) of a coding sequence and extends 
upstream (5' direction) to include the minimum number of bases or elements necessary to 
initiate transcription at levels detectable above background. Within the promoter sequence 
20 will be found a transcription initiation site (conveniently defined by mapping with nuclease 
SI), as well as protein binding domains (consensus sequences) responsible for the binding 
of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" 
boxes and "CAT' boxes. Prokaryotic promoters contain Shine-Dalgarno sequences in 
addition to the -10 and -35 consensus sequences. 
25 DNA "control sequences" refers collectively to promoter sequences, ribosome 

binding sites, polyadsnylation signals, transcription termination sequences, upstream 
regulatory domains, enhancers, and the like, which collectively provide for the expression 
(i.e.. the transcription and translation) of a coding sequence in a host cell. 

A control sequence "directs the expression" of a coding sequence in a cell when 
30 RNA polymerase will bind the promoter sequence and transcribe the coding sequence into 
mRNA, which is then translated into the polypeptide encoded by the coding sequence. 

A "host cell" is a cell which has been transformed or transfected. or is capable of 
transformation or transfection by an exogenous DNA sequence. 
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A cell has been "transformed" by exogenous DNA when such exogenous DNA has 
been introduced inside the cell membrane. Exogenous DNA may or may not be integrated 
(covalently linked) into chromosomal DNA making up the genome of the cell. In 
prokaryotes and yeasts, for example, the exogenous DNA may be maintained on an 
5 episomal element, such as a plasmid. With respect to eukaryotic cells, a stably transformed 
or transfected cell is one in which the exogenous DNA has become integrated into the 
chromosome so that it is inherited by daughter cells through chromosome replication. This 
stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or clones 
comprised of a population of daughter cell containing the exogenous DNA. 
10 A "clone" is a population of cells derived from a single cell or common ancestor by 

mitosis. A "cell line" is a clone of a primary cell that is capable of stable growth in vitro for 
many generations. 

A "heterologous" region of a DNA construct is an identifiable segment of DNA 
within or attached to another DNA molecule that is not found in association with the other 

15 molecule in nature. 

This invention is also related to the use of the polynucleotides of the invention for use as 
diagnostic reagents. Detection of a polynucleotide or polypeptide of the invention in a 
eukaryote, particularly a mammal, and especially a human, will provide a diagnostic method for 
diagnosis of a disease. Eukaryotes (herein also "individual(s)"), particularly mammals, and 

20 especially humans, infected with an organism comprising a polynucleotide of the invention may 
be detected at the nucleic acid level by a variety of techniques. 

Nucleic acids for diagnosis may be obtained from an infected individual's cells and 
tissues, such as bone, blood, muscle, cartilage, and skin. Genomic DNA may be used directly for 
detection or may be amplified enzymatically by using PCR or other amplification technique prior 

25 to analysis. RNA or cDNA may also be used in the same ways. Using amplification, 
characterization of the species and strain of prokaryote present in an individual, may be made by 
an analysis of the genotype of the prokaryote gene. Deletions and insertions can be detected by a 
change in size of the amplified product in comparison to the genotype of a reference sequence. 
Point mutations can be identified by hybridizing amplified DNA to labeled polynucleotide 

30 sequences of the invention. Perfectly matched sequences can be distinguished from mismatched 
duplexes by RNase digestion or by differences in melting temperatures. DNA sequence 
differences may also be detected by alterations in the electrophoretic mobility of the DNA 
fragments in gels, with or without denaturing agents, or by direct DNA sequencing. See, e.g., 
Myers et a!.. Science, 230: 1242 (1985). Sequence changes at specific locations also may be 
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revealed by nuclease protection assays, such as RNase and SI protection or a chemical cleavage 

method. See. e.g.. Cotton et aL. Proc. NatL Acad. Set. USA, 85: 4397-4401 

(1985). 

Cells carrying mutations or polymorphisms in the gene of the invention may also be 
5 detected at the DNA level by a variety of techniques, to allow for serotyping, for example. For 
example, RT-PCR can be used to detect mutations. It is particularly preferred to used RT-PCR 
in conjunction with automated detection systems, such as, for example, GeneScan. RNA or 
cDNA may also be used for the same purpose, PCR or RT-PCR. As an example, PCR primers 
complementary to a nucleic acid encoding a polypeptide of the invention can be used to identify 
10 and analyze mutations. These primers may be used for, among othe4r things, amplifying DNA 
isolated from a sample derived from an individual. The primers may be used to amplify the gene 
isolated from an infected individual such that the gene may then be subject to various techniques 
for elucidation of the DNA sequence. In this way. mutations in the DNA sequence may be 
detected and used to diagnose infection and to serotype and/or classify the infectious agent. 
15 The invention further provides a process for diagnosing, disease, preferably bacterial 

infections, more preferably infections by Streptococcus pneumoniae, comprising determining 
from a sample derived from an individual a increased level of expression of polynucleotide 
having a sequence set forth in the Sequence Listing or a sequence of the invention. 
Increased or decreased expression of a polynucleotide of the invention can be measured 
20 using any on of the methods well known in the art for the quotation of polynucleotides, 
such as. for example, amplification. PCR. RT-PCR. RNase protection. Northern blotting 
and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over- 
expression of a polypeptide of the invention compared to normal control tissue samples may be 
25 used to detect the presence of an infection, for example. Assay techniques that can be used to 
determine levels of a polypeptide of the invention, in a sample derived from a host are well- 
known to those of skill in the art. Such assay methods include radioimmunoassays, competnive- 
binding assays. Western Blot analysis and ELISA assays. 

In accordance with yet a further aspect of the present invention, there is provided 
30 the use of a polypeptide of the invention for therapeutic or prophylactic purposes, for 
example, as an antibacterial agent or a vaccine. 

In accordance with another aspect of the present invention, there is provided the use 
of a polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic immunisation. 
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In accordance with yet another aspect of the present invention, there are provided 
inhibitors to such polypeptides, useful as antibacterial agents. In particular, there are 
provided antibodies against such polypeptides. 

Polypeptides of the invention may also be used to assess the binding of small molecule 
5 substrates and ligands in, for example, cells, cell-free preparations, chemical libraries, and natural 
product mixtures. These substrates and ligands may be natural substrates and ligands or may be 
structural or functional mimetics. See, e.g., Coligan et aL 9 Current Protocols in Immunology 
1(2): Chapter 5 (1991). 

The invention also provides a method of screening compounds to identify those which 

10 enhance (agonist) or block (antagonist) the action of polypeptides or polynucleotides of the 
invention, particularly those compounds that are bacteriostatic and/or bacteriocidal. The method 
of screening may involve high-throughput techniques. For example, to screen for agonists or 
antagoists, a synthetic reaction mix, a cellular compartment, such as a membrane, cell envelope 
or cell wall, or a preparation of any thereof, comprising a polypeptide of the invention and a 

15 labeled substrate or ligand of such polypeptide is incubated in the absence or the presence of a 
candidate molecule that may be an agonist or antagonist of a polypeptide of the invention. The 
ability of the candidate molecule to agonize or antagonize a polypeptide of the invention is 
reflected in decreased binding of the labeled ligand or decreased production of product from such 
substrate. Molecules that bind gratuitously, i.e., without inducing the effects of a polypeptide of 

20 the invention are most likely to be good antagonists. Molecules that bind well and increase the 
rate of product production from substrate are agonists. Detection of the rate or level of production 
of product from substrate may be enhanced by using a reporter system. Reporter systems that 
may be useful in this regard include but are not limited to colori metric labeled substrate 
converted into product, a reporter gene that is responsive to changes in polynucleotide or 

25 polypeptide activity, and binding assays known in the art. 

Another example of an assay for antagonists of a polypeptide of the invention is a 
competitive assay that combines such polypeptide and a potential antagonist with polypeptide- 
binding molecules, recombinant polypeptide- binding molecules, natural substrates or ligands, or 
substrate or ligand mimetics, under appropriate conditions for a competitive inhibition assay. 

30 Polypeptides of the invention can be labeled, such as by radioactivity or a colorimetric 
compound, such that the number of such polypeptide molecules bound to a binding molecule or 
converted to product can be determined accurately to assess the effectiveness of the potential 
antagonist. 
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Potential antagonists include small organic molecules, peptides, polypeptides and 
antibodies that bind to a polynucleotide or polypeptide of the invention and thereby inhibit or 
extinguish its activity. Potential antagonists also may be small organic molecules, a peptide, a 
polypeptide such as a closely related protein or antibody that binds the same sites on a binding 
5 molecule, without inducing an activities of a polypeptide of the invention, thereby preventing the 
action of such polypeptide by excluding it from binding. 

Potential antagonists include a small molecule that binds to and occupies the binding 
site of the polypeptide thereby preventing binding to cellular binding molecules, such that 
normal biological activity is prevented. Examples of small molecules include but are not limited 
10 to small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano, J. Neurochem. 56: 560 (1991); 
OUGODEOXYNUCLEOTIDES AS ANTISENSE INHIBITORS OF GENE EXPRESSION, CRC 
Press, Boca Raton, FL (1988), for a description of these molecules). Preferred potential 
antagonists include compounds related to and variants of the polypetides of the invention. 
15 Each of the DNA sequences provided herein may be used in the discovery and 

development of antibacterial compounds. The encoded protein, upon expression, can be 
used as a target for the screening of antibacterial drugs. Additionally, the DNA sequences 
encoding the amino terminal regions of the encoded protein or Shine-Delgamo or other 
translation facilitating sequences of the respective mRNA can be used to construct antisense 
20 sequences to control the expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide or inhibitor 
of the invention to interfere with the initial physical interaction between a pathogen and 
mammalian host responsible for sequelae of infection. In particular the molecules of the 
invention may be used: in the prevention of adhesion of bacteria, in particular gram positive 
25 bacteria, to mammalian extracellular matrix proteins on in-dwelling devices or to 
extracellular matrix proteins in wounds; to block protein-mediated mammalian cell invasion 
by. for example, initiating phosphorylation of mammalian tyrosine kinases (Rosenshine et 
al. Infect. Immun. 60:2211 (1992); to block bacterial adhesion between mammalian 
extracellular matrix proteins and bacterial proteins that mediate tissue damage and; to block 
30 the normal progression of pathogenesis in infections initiated other than by the implantation 
of in-dwelling devices or by other surgical techniques. 

The antagonists and agonists of the invention may be employed, for instance, to inhibit 
and treat otitis media, conjunct.vitis. pneumonia, bacteremta, meningitis, sinusitis, pleural 
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empyema and endocarditis, and most particularly meningitis, such as for example infection of 
cerebrospinal fluid. 

Another aspect of the invention is a pharmaceutical composition comprising the 
above polypeptide, polynucleotide or inhibitor of the invention and a pharmaceutical^ 
5 acceptable carrier. 

In a particular aspect the invention provides the use of an inhibitor of the invention 
as an antibacterial agent. 

The invention further relates to the manufacture of a medicament for such uses. 

The polypeptide may be used as an antigen for vaccination of a host to produce 
10 specific antibodies which have anti-bacterial action. This invention also contemplates the 
use of the DNA encoding the antigen as a component in a DNA vaccine as discussed more 
fully below. 

The polypeptides or cells expressing them can be used as an immunogen to produce 
antibodies thereto. These antibodies can be, for example, polyclonal or monoclonal 
15 antibodies. The term antibodies also includes chimeric, single chain, and humanized 

antibodies, as well as Fab fragments, or the product of an Fab expression library. Various 
procedures known in the art may be used for the production of such antibodies and 
fragments. 

Antibodies generated against the polypeptides of the present invention can be 
20 obtained by direct injection of the polypeptides into an animal or by administering the 

polypeptides to an animal, preferably a nonhuman. The antibody so obtained will then bind 
the polypeptides itself. In this manner, even a sequence encoding only a fragment of the 
polypeptides can be used to generate antibodies binding the whole native polypeptides. 
Such antibodies can then be used to isolate the polypeptide from tissue expressing that 
25 polypeptide. 

Polypeptide derivatives include antigenically or immunologically equivalent 
derivatives which form a particular aspect cf this invention. 

The term 'antigenically equivalent derivative' as used herein encompasses a 
polypeptide or its equivalent which will be specifically recognised by certain antibodies 
30 which, when raised to the protein or polypeptide according to the present invention, 
interfere with the interaction between pathogen and mammalian host. 

The term 'immunologically equivalent derivative' as used herein encompasses a 
peptide or its equivalent which when used in a suitable formulation to raise antibodies in a 
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vertebrate, the antibodies act to interfere with the interaction between pathogen and 
mammalian host. 

In particular derivatives which are slightly longer or slightly shorter than the native 
protein or polypeptide fragment of the present invention may be used. In addition. 
5 polypeptides in which one or more of the amino acid residues are modified may be used. 
Such peptides may, for example, be prepared by substitution, addition, or rearrangement of 
amino acids or by chemical modification thereof. All such substitutions and modifications 
are generally well known to those skilled in the art of peptide chemistry. 

The polypeptide, such as an antigenically or immunologically equivalent derivative 
10 or a fusion protein thereof is used as an antigen to immunize a mouse or other animal such 
as a rat or chicken. The fusion protein may provide stability to the polypeptide. The 
antigen may be associated, for example by conjugation , with an immunogenic carrier 
protein for example bovine serum albumin (BSA) or keyhole limpet haemocyanin (KLH). 
Alternatively a multiple antigenic peptide comprising multiple copies of the protein or 
1 5 polypeptide, or an antigenically or immunologically equivalent polypeptide thereof may be 
sufficiently antigenic to improve immunogenicity so as to obviate the use of a carrier. 

For preparation of monoclonal antibodies, any technique which provides antibodies 
produced by continuous cell line cultures can be used. Examples include the hybridoma 
technique (Kohler and Milstein. MatUE, 256:495-497(1975)). the trioma technique, the 
20 human B-cell hybridoma technique (Kozbor et al., Immunology Today 4:72(1983)). and the 
EBV-hybridoma technique to produce human monoclonal antibodies (Cole, et al.. 1985, in 
Monoclonal Antibodies and Cancer Therapy. Alan R. Liss, Inc.. pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. Patent 
4.946.778) can be adapted to produce single chain antibodies to immunogenic polypeptide 

25 products of this invention. 

Using the procedure of Kohler and Milstein (supia. (1975)), antibody-containing 
cells from the immunised mammal are fused with myeloma cells to create hybridoma cells 
secreting monoclonal antibodies. 

The hybridomas are screened to select a cell line with high binding affinity and 
30 favorable cross reaction with other Streptococcal species using one or more of the original 
polypeptide and/or the fusion protein. The selected cell line is cultured to obtain the desired 
Mab. 

Hybridoma cell lines secreting the monoclonal antibody are another aspect of this 
invention. 
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Alternatively phage display technology could be utilised to select antibody genes 
with binding activities towards the polypeptide either from repertoires of PCR amplified v- 
genes of lymphocytes from humans screened for possessing anti-Fbp or from naive libraries 
(McCafferty, J. et a/., Nature 348:552-554(1990), and Marks, J. et a/., Biotechnology 
5 10:779-783(1992)). The affinity of these antibodies can also be improved by chain 
shuffling (Clackson, T. et aL, MatfflE 352:624-628(1991)). 

The antibody should be screened again for high affinity to the polypeptide and/or 
fusion protein. 

As mentioned above, a fragment of the final antibody may be prepared. 
10 The antibody may be either intact antibody of M r approx 150,000 or a derivative of 

it, for example a Fab fragment or a Fv fragment as described in Skerra, A and Pluckthun, 
A., Science 240:1038-1040 (1988). If two antigen binding domains are present each 
domain may be directed against a different epitope - termed 'bi specific' antibodies. 

The antibody of the invention may be prepared by conventional means for example 
15 by established monoclonal antibody technology (Kohler, G. and Milstein, C. (supra . ( 1975) 
or using recombinant means e.g. combinatorial libraries, for example as described in Huse, 
W.D. €/a/. f Science 246:1275-1281 (1989). 

Preferably the antibody is prepared by expression of a DNA polymer encoding said 
antibody in an appropriate expression system such as described above for the expression of 
20 polypeptides of the invention. The choice of vector for the expression system will be 

determined in part by the host, which may be a prokaryotic cell, such as E. coli (preferably 
strain B) or Streptomyces sp. or a eukaryotic cell, such as a mouse CI 27, mouse myeloma, 
human HeLa, Chinese hamster ovary, filamentous or unicellular fungi or insect cell. The 
host may also be a transgenic animal or a transgenic plant (for example, as described in 
25 Hiatt, A. et aL t NatUtt 340:76-78(1989). Suitable vectors include plasmids, bacteriophages, 
cosmids and recombinant viruses, derived from, for example, baculoviruses and vaccinia. 

The Fab fragment may also be prepared from its parent monoclonal antibody by 
enzyme treatment, for example using papain to cleave the Fab portion from the Fc portion. 

Preferably the antibody or derivative thereof is modified to make it less 
30 immunogenic in the patient. For example, if the patient is human the antibody may most 
preferably be 'humanised'; where the complimentarity determining region(s) of the 
hybridoma-derived antibody has been transplanted into a human monoclonal antibody , for 
example as described in Jones, P. et a/„ Natuie 321:522-525 (1986), or Tempest et a/., 
BiojficJuioJp^ (1991). 
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The modification need not be restricted to one of 'humanisation' ; other primate 
sequences (for example Newman, R. et al.. Biotechnology 10:1455-1460 (1992)) may also 
be used. 

The humanised monoclonal antibody, or its fragment having binding activity, form 
5 a particular aspect of this invention. 

This invention provides a method of screening drugs to identify those which 
interfere with the proteins selected as targets herein, which method comprises measuring the 
interference of the activity of the protein by a test drug. For example if the protein selected 
has a catalytic activity, after suitable purification and formulation the activity of the enzyme 
10 can be followed by its ability to convert its natural substrates. By incorporating different 
chemically synthesised test compounds or natural products into such an assay of enzymatic 
activity one is able to detect those additives which compete with the natural substrate or 
otherwise inhibit enzymatic activity. 

The invention also relates to inhibitors identified thereby. 
15 The use of a polynucleotide of the invention in genetic immunisation will 

preferably employ a suitable delivery method such as direct injection of ptasmid DNA into 
muscles (Wolff et al.. Mnl flenet. 1 :363 (1992); Manthorpe et al., Hlim , fans ThCT - 
4:419 (1963)), delivery of DNA complexed with specific protein carriers ( Wu et al., L 
piol Chem . 264:16985 (1989)). coprecipitation of DNA with calcium phosphate 
20 (Benvenisty & Reshef, Prpr NM'I Afflnl. Scj, USA. 83:9551 (1986)). encapsulation of DNA 
in various forms of liposomes (Kaneda et al.. SsWX. 243:375 (1989)). particle 
bombardment (Tang et al., Ual3X& 356:152 (1992)); Eisenbraun etal, PNA Cell Bio l , 
12:79 1 (1993)) and in vivo infection using cloned retroviral vectors (Seeger et al.. EtQC 
m <\ A ^ rf Sci. USA 81:5849 (1984)). Suitable promoters for muscle transfection include 
25 CMV. RSV, SRa, actin. MCK. alpha globin. adenovirus and dihydrofolate reductase. 

In therapy or as a prophylactic, the active agent i.e.. the polypeptide, polynucleotide 
or inhibitor of the invention, may be administered to a patient as an injectable composition, 
for example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be formulated for topical application 
30 for example in the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, 
mouthwash, impregnated dressings and sutures and aerosols, and may contain appropriate 
conventional additives, including, for example, preservatives, solvents to assist drug 
penetration, and emollients in ointments and creams. Such topical formulations may also 
contain compatible conventional carriers, for example cream or ointment bases, and ethanol 
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or oleyl alcohol for lotions. Such carriers may constitute from about 1 % to about 98% by 
weight of the formulation; more usually they will constitute up to about 80% by weight of 
the formulation. 

For administration to human patients, it is expected that the daily dosage level of 
5 the active agent will be from 0.01 to 10 mg/kg, typically around 1 mg/kg. The physician in 
any event will determine the actual dosage which will be most suitable for an individual 
patient and will vary with the age, weight and response of the particular patient. The above 
dosages are exemplary of the average case. There can, of course, be individual instances 
where higher or lower dosage ranges are merited, and such are within the scope of this 
10 invention. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants 
may be employed to enhance the immune response. 

A suitable unit dose for vaccination is 0.5-5 ug/kg of antigen, and such dose is 
preferably administered 1-3 times and with an interval of 1-3 weeks. 
15 Within the indicated dosage range, no adverse toxicologicals effects are expected 

with the compounds of the invention which would preclude their administration to suitable 
patients. 

In order to facilitate understanding of the following example certain frequently 
occurring methods and/or terms will be described. 

20 "Plasmids" are designated by a lower case p preceded and/or followed by capital 

letters and/or numbers. The starting plasmids herein are either commercially available, 
publicly available on an unrestricted basis, or can be constructed from available plasmids in 
accord with published procedures. In addition, equivalent plasmids to those described are 
known in the art and will be apparent to the ordinarily skilled artisan. 

25 "Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction 

enzyme that acts only at certain sequences in the DNA. The various restriction enzymes 
used herein are commercially available and their reaction conditions, cofactors and other 
requirements were used as would be known to the ordinarily skilled artisan. For analytical 
purposes, typically 1 ug of plasrnid or DNA fragment is used with about 2 units of enzyme 

30 in about 20 ul of buffer solution. For the purpose of isolating DNA fragments for plasrnid 
construction, typically 5 to 50 ug of DNA are digested with 20 to 250 units of enzyme in a 
larger volume. Appropriate buffers and substrate amounts for particular restriction enzymes 
are specified by the manufacturer. Incubation times of about 1 hour al 37° C are ordinarily 



27 



WO 97/37026 

used, but may vary in accordance with the supplier's instructions. After digestion the 
reaction is electrophoresed directly on a polyacrylamide gel to isolate the desired fragment. 

Size separation of the cleaved fragments is performed using 8 percent 
polyacrylamide gel described by Goeddel. D. et al, (1980) Nucleic Acids Res.. 8:4057. 

5 "Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 

complementary polydeoxynucleotide strands which may be chemically synthesized. Such 
synthetic oligonucleotides have no 5' phosphate and thus will not ligate to another 
oligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A 
synthetic oligonucleotide will ligate to a fragment that has not been dephosphorylated. 

10 "Ligation" refers to the process of forming phosphodiester bonds between two 

double stranded nucleic acid fragments (Maniatis, T.. et al., sapm. P- 146). Unless 
otherwise provided, ligation may be accomplished using known buffers and conditions with 
10 units to T4 DNA ligase ("ligase") per 0.5 ug of approximately equimolar amounts of the 
DNA fragments to be ligated. 

IS Example 1 

Isolation of DNA coding for a virulence gene in Streptococcus pneumoniae 

As mentioned above each of the DN As disclosed herein by virtue of the fact that it 
includes an intact open reading frame is useful to a greater or lesser extent as a screen for 
identifying antimicrobial compounds. A useful approach for selecting the preferred DNA 
20 sequences for screen development is evaluation by insertion-duplication mutagenesis. This 
system disclosed by Morrison et al.. L-Ba£l££i2l- 159:870 (1984). is applied as follows. 

Briefly, random fragments of Streptococcus pneumoniae, strain 0100993 DNA are 
generated enzymatically (by restriction endonuclease digestion) or physically (by sonication 
based shearing) followed by gel fractionation and end repair employing T4 DNA 
25 polymerase. It is preferred that the DNA fragments so produced are in the range of 200-400 
base pairs, a size sufficient to ensure homologous recombination and to insure a 
representative library in Exoli. The fragments are then inserted into appropriately tagged 
plasmids as described in Hensel el al.. Site 269: 400-403(1995). Although a number of 
plasmids can be used for this purpose, a particularly useful plasmid is pJDC9 described by 
30 Pearce et al., Mol Microbiol . 9:1037 (1993) which carries the erm gene facilitating 
erythromycin selection in either E. coli or S. pneumoniae previously modified by 
incorporation of DNA sequence tags into one of the poly linker cloning sites. The tagged 
plasmids are introduced into the appropriate S. pneumoniae strain selected, inter alia, on the 
basis of serotype and virulence in a murine model of pneumococcal pneumonia. 
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It is appreciated that a seventeen amino acid competence factor exists (Havastein et 
al., EtQ£. Mafl. Acad. SsL USA 92: 1 1 140-44 (1995)) and may be usefully employed in this 
protocol to increase the transformation frequencies. A proportion of transformants are 
analysed to verify homologous integration and as a check on stability. Unwanted levels of 
5 reversion are minimized because the duplicated regions will be short (200-400 bp), however 
if significant reversion rates are encountered they may be modulated by maintaining 
antibiotic selection during the growth of the transformants in culture and/or during growth 
in the animal. 

The 5. pneumoniae transformants are pooled for inoculation into mice, eg., Swiss 
10 and/or C57B 1/6. Preliminary experiments are conducted to establish the optimum 
complexity of the pools and level of inoculum. A particularly useful model has been 
described by Veber et al. (J. Antimicrobiol. Chemother (1993) in which 10 5 cfu 

inocula sizes are introduced by mouth to the trachea. Strain differences are observed with 
respect to onset of disease e.g.,3-4 days for Swiss mice and 8-10 days for C57B1/6. 

15 Infection yields in the lungs approach 10 8 cfu/lung. IP administration is also possible when 
genes mediating blood stream infection are evaluated. Following optimization of 
parameters of the infection model, the mutant bank normally comprising several thousand 
strains is subjected to the virulence test. Mutants with attenuated virulence are identified by 
hybridization analysis using the labelled tags from the "input" and "recovered" pools as 

20 probes as described in Hensel si al„ Sfikncs 269: 400-403( 1 995). S. pneumoniae DNA is 
colony blotted or dot blotted, DNA flanking the integrated plasmid is cloned by plasmid 
rescue in £. coli (Morrison et al. t J, BacterjoJ. 159:870 (1984)) and sequenced. Following 
sequencing, the DNA is compared to the nucleotide sequences given herein and the 
appropriate ORF is identified and function confirmed for example by knock-out studies. 

25 Expression vectors providing the selected protein are prepared and the protein is configured 
in an appropriate screen for the identification of anti-microbial agents. Alternatively, 
genomic DNA libraries are probed with restriction fragments flanking the integrated 
plasmid to isolate full-length cloned virulence genes whose function can be confirmed by 
"knock-out" studies or other methods, which are then expressed and incorporated into a 

30 screen as described above. 

The individual full length sequences given herein are summarized in the following 
Tables (Table 1 and Table 2). Under the column in Table 1 labeled "Identity" there is the 
deduced identity of each open reading frame of the invention determined using Blastp 
and/or MPSearch. The ORF# column indicates whether the polynucleotide encoding each 
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ORF encodes more than one ORF. For example. SEQ ID NO:224 has an ORF# 1 but no 
#2; thus the polynucleotide encodes this ORF#l. but no other ORF was detected. On the 
other hand, SEQ ID NO:225 and 226 have ORF#s 1 and 2 respectively, indicating that they 
were both encoded by the same polynucleotide. This can also be seen in Table 2 where the 

5 polynucleotide of SEQ ID NO: 2 encodes the ORFs of SEQ ID NOS: 225 and 226. Table 1 
also shows the start ("START" column) and stop ("STOP" column) codons for each ORF 
and their positions in the encoding polynucleotide sequence. The SEQ ID NOS of the 
polypeptides of this table are linked to both a deduced identity in this table and a 
polynucleotide sequence in Table 2 which encodes each polypeptide. The "Direction" 

10 column in Table 1 shows the direction of the ORF encoding each poypetide in this table. 
"Forward" denotes the sense orientation and "Reverse" denotes the antisense orientation of 
the ORF. 
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IABLEJ 







i 


Codon 


1 


Position 






SEQ ID NO: ] 


dentity < 


3RF# ! 


Start 1 


Stop ! 


Start ! 


Stop 1 


Direction 


224. 1 
1 


Phosphate Transport ATP-Binding Protein 
PSTB. - Escherichia 


■ 


-CAT ' 


rcA- ; 


1 


553 1 


Reverse 


225. ] 


probable ATP binding protein - Bacillus 
subtilis 


1 


-CAT ' 


rcA- 


38 


781 


Reverse 


226. 


Nopaline Transport System Permease 
Protein Nocm. - AjErobacterium Tumcfaciens 


2 


-CAG 


TTA- 


781 


1512 


Reverse 


227. 


Aspartate Aminotransferase A (EC 2.6.1 1) 
(Transaminase A) (ASPAT.) - Rhizobium 
Meiiloti. 


1 


ATG 


TAG 


480 


671 


Forward 


228. 


ISL2 protein - Lactobacillus helveticus 


1 


ATG 


TAA 


549 


947 


Forward 


229. 


ISL2 protein - Lactobacillus helveticus 


2 


ATG 


TAG 


889 


1353 


Forward 


230. 


Unknown 


3 


ATG 


TGA 


1421 


1570 


Forward 


231. 


Unknown 


2 


-CAT 


CTA- 


111 


413 


Reverse 


232. 


Unknown 


1 


ATG 


TGA 


133 


282 


Forward 


233. 


Acyl Carrier Protein ( ACP.) - Rhizobium 
Meiiloti. 


1 


-CAT 


CTA- 


673 


906 


Reverse 


234. 


Unknown 


1 


-CAT 


TTA- 


15 


137 


Reverse 


235. 


Unknown 


2 


-CAT 


CTA- 


681 


989 


Reverse 


236. 


Sulfate Transport System Permease Protein 
CYST (Fragment). - Synechocystis SP. 
(Strain PCC 6803). 


2 


-CAT 


TTA- 


336 


689 


Reverse 


237. 


probable transposase (insertion sequence 
IS861) - Streptococcus agalacuae (str ain 

coH-n 


2 


-CAT 


TCA- 


149 


454 


Reverse 


238. 


Unknown 


2 


-CAG 


TTA- 


567 


851 


Reverse 


239. 


"PTS SYSTEM 


1 


ATG 


TAA 


49 


477 


Forward 


240. 


H PTS SYSTEM 


2 


-CAT 


TTA- 


151 


789 


Reverse 


241. 


nitrogen fixation protein (nifS) homotog - 
Haemophilus influenzae (strain Rd KW 20) 




-CAT 


TCA- 


1 


207 


Reverse 


242. 


ceilobiose phosphotransferase system celA • 
Bacillus stearothermophilus 


1 


-CAT 


TCA- 


84 


431 


Reverse 


243. 


surface protein PspA - Streptococcus 
pneumoniae 


1 


ATG 


TAA 


22 


321 


Forward 


244. 


Unknown 


2 


ATG 


TAA 


272 


469 


Forward 


245. 


ATV-dependent Clp proteinase (EC 3.4.21.-) 
chain cipL - Lactococcus lactis subs p. lactis 


1 


ATG 


TGA 


83 


760 


Forward 
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1 


slasmid pUCL22 


1 








246. 


Beta-Glucosidase (EC3.2.1.21) 
[Gemiobiase) (Cellobiase) ) (Beta-D- 
Glucoside GL Ucohydrolase) (Amygdalasc). 
- Bacillus Subtilis. 


1 


-CAT 


TCA- 


153 


599 


Reverse 


247. 


Glycine Betaine/L-Proline Transport ATP - 
Binding Protein Prov. - Escherichia Col. I. 


2 


ATG 


TAA 


159 


887 


Forward 


248. 


Unknown 


3 


-CAT 


TTA- 


1102 


1254 


Reverse 


249. 


30S Ribosomal Protein SI 1 (BSll). - 
Bacillus Subtilis. 


1 


ATG 


TGA 


15 


164 


Forward 


250. 


DNA - Directed RNA Polymerase Alpha 
Chain (EC 2.7.7.6) (Transcriptase Alpha 
Chain). - Bacillus Subtilis. 


2 


ATG 


TAA 


282 


1217 


Forward 


251. 


Peptide Chain Release Factor 3 (RF-3). - 
Bacteroides Nodosus (Dichelobactcr Nod 
Osus). 


1 


ATG 


TAG 


212 


667 


Forward 


252. 


Unknown 


1 


ATG 


TAG 


267 


353 


Forward 


253. 


Riboflavin Synthase Alpha Chain (EC 
2.5.1.9). - Bacillus Subtilis. 


2 


-CAT 


CTA- 


213 


662 


Reverse 


254. 


Unknown 


3 


-CAT 


TTA- 


833 


1045 


Reverse 


255. 


Unknown 


1 


-CAT 


TTA- 


83 


424 


Reverse 


256. 


mesl protein - Leuconostoc mesenteroides 


2 


ATG 


TAG 


448 


1302 


Forward 


257. 


Transacetylase BMTD (EC 2.3.1.-). - 
Bacillus Subtilis. 


1 


CTG 


TAG 


3 


320 


Forward 
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Codon 




Position 






SEQ ID NO: 


Identity 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


258. 


nbosc-phosphate pyrophosphokinase (EC 
2.7.6.1.) - Bacillus caldolvttcus 


1 


CTG 


TAG 


1 


642 


Forward 


259. 


Unknown 


1 


ATG 


TAA 


66 


614 


Forward 


260. 


Unknown 


1 


ATG 


TAG 


108 


590 


Forward 


261. 


nitrogenase C (nifC) homolog - 
Haemophilus influenzae (strain Rd KW20) 


2 


ATG 


TAG 


631 


855 


Forward 


262. 


Unkown 


2 


ATG 


TAA 


606 


752 


Forward 


263. 


Unknown 


2 


ATG 


TGA 


280 


495 


Forward 


264. 


Unknown 


2 


ATG 


TGA 


639 


1466 


Forward 


265. 


Acetyl Esterase (EC 3.1 .-.-). - Caldocelluro 
Saccharolvticum. 


2 


ATG 


TAA 


274 


594 


Forward 


266. 


Unknown 


1 


-CAT 


TCA- 


2 


157 


Reverse 


267. 


Triosephosphaste Isomerase (EC 5.3. 1.1) 
(TIM). - Bacillus Subtilis. 


1 


-CAT 


TTA- 


270 


665 


Reverse . 


268. 


Branched -chain Amino Acid 
Aminotransferase (EC 2.6. 1 .42) 
(Transaminase B). - ES Cherichia Coli. 


1 


ATG 


TAG 


no 


736 


Forward 


269. 


branched-chain-amino-acid transaminase 
homolog - Haemophilus influenzae (strain 
Rd KW20) 


2 


ATG 


TAA 


708 


842 


Forward 


270. 


DnaK protein - Lactococcus lactis 


I 


CTG 


TGA 


3 


749 


rurwoiu 


271. 


Ketol- Acid Reductoisomerase (EC 1.1.1 .86) 
(Acetohydroxy-Acid Isomeroreductase). - 
Lactococcus Lactis (Subsp. Lactis) 
(Streptococcus Lactis) 


1 


ATG 


TAA 


99 


428 


Forward 


272. 


Unknown 


1 


-CAT 


CTA- 


278 


631 




273. 


Ainidophosphoribosy [transferase Precursor 
(EC 2.4.2.14) (Glutamtne Phosphonbosy 
Lpyrophosphaste Amidotransf erase) (Atase). 
- Bacillus Subtilis. 




-CAT 


CTA- 


152 


775 


Reverse 


274. 


Pyrrolidone-Carboxylate Peptidase (EC 
3.4.19.3) (5-Oxoprolyl-Peptidase). - STR 
Eptococcus Pvosenes. 


1 


-CAT 


TCA- 


156 


803 


Reverse 


275. 


50S Ribosoraal Protein LI 6. - Mycoplasma 
Caphcolum. 


1 


ATG 


TAA 


33 


416 


Forward 


276. 


serine O-acetyltransferase (EC 2.3.1.301 - 
Bacillus stearothermophilus 


2 


-CAT 


CTA- 


577 


1194 


Reverse 


277. 


Unknown 


1 


ATG 


TAG 


61 


648 


Forward 
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|278. 1 Unknown 1 




CAT C 


TA- 1 


65 3 


35 If 


Reverse 


1279. Lipoprotein Signal Peptidase (EC 3.4.23.36) 1 
(Prolipopiotein Signal Peptidase) (Signal 
Peptidase II) (Spase ID. - Staphylococcus 

I I Aureus. 




iTG 1 


*AA 5 


0 5 


I / r 


: orward 


7Rn Unknown - . 


> 


VTG 1 


rAA 1 


114 i 


S34 I 


: orward 


1281. \i 
1 1 


Upha-Acetolactate Decarboxylase (EC 1 
1.1.1.5). -Bacillus Subtilis. 


I 


-CAT < 


JTA- 1 


04 < 


145 1 


Reverse 


no l 
[Zo*. 1 

1 


DihvrimHinicolinate Synthase 9EC 4.2.1.52) ' 
DHDPS). - Bacillus Subtilis. 


J 


-CAT ' 


rcA- < 


575 


&84 1 


Reverse 


1 


Priivrihrtnurleotide Nucleotidvltransferase 
[EC 2.7.7.8) (Polynucleotide Phosphor 
Ylase) (Pnoase). * Photorhabdus 
Luminescent — — 


1 


-CAT ' 


rcA- 


1 


855 


Reverse 


284. 


Single-Strand Binding Protein (SSB) (Helix- 
Destabilizing Protein). - Bacillus S Ubtilis 


I 


-CAT 


TTA- 


128 


598 


Reverse 


285. 


ATP-Dependent CLP Protease ATO- 
Binding Subunit CLPX. - Escherichia Coli. 


2 


-CAT 


CTA- 


195 


482 


Reverse 


1286. 


ATP-Dependent CLP Protease ATO- 
Bindina Subunit CLPX. - Escherichia Cott_ 


4 


-CAG 


TCA- 


676 


990 


Reverse 


287. 


N-(5-Phosphoribosyl) Anthranilate 
Isomerase (EC 5.3.1.24) (PRA1). - 
Lactococcus Lactis (Subsp. Lactis) 
(Streptococcus Lactis). _ 


1 


ATG 


TAA 


83 


712 


Forward 


288. 


30S Ribosomal Protein S2. - Escherichia 
Coli. 


1 


ATG 


TGA 


277 


591 


Forward 




289. 


30S Ribosomal Protein S2. - Escherichia 


3 


ATG 


TGA 


918 


10641 Forward 

1 


290. {possible beta-gaiaciosi**** precursor 


1 


-CAT 


CTA- 


120 


545 I Reverse 


291, Unknown 


1 


-CAT 


TTA- 


77 


283 


Reverse 


292. Unknown _ 


1 


-CAT 


TTA- 


I 


198 


Reverse 


293. | H Phospho-2-Dehydfo-3-Deoxyheptonate 
1 Aldolase . — 


1 


-CAT 


TTA- 


217 


690 


Reverse 
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< 




] 


Position 






cvn in NO* 1 




0RF# ! 


Stmt I 


Stop ! 


Start 


Stop 


direction 


294. I 


Unknown 


2 


-CAT ' 


IT A- - 


*29 


599 


Reverse 


295. 1 


Jnknown 


3 


-CAT * 


TCA- ' 


739 


936 


Reverse 


296. 


t ■ . . i fs ^ 1 i — - 111 t i\ / T 

Sorbitol Dehydrogenase (EC 1.1.1.14) (L- 
Iditol 2-Dehydrogenase). « Bacillus Sub 

Till* 


1 


ATT 

ATU 


TA A 
IAA 






Forward 


297. 


EBG Operon Repressor. - Escherichia Coli. 


2 


-CAT 


CTA- 


668 


1060 


Reverse 


298. 


cellobiose phosphotransferase system celA - 
Bacillus stearothermophilus 


2 


A T 

-CAT 


tt a 
TTA- 


Z4V 


3 OO 


Reverse 


299. 


Unknown 


3 


A T 

-CAT 


TCA- 


CO t 

DO! 


Vo4 


Reverse 


300. 


Adenylosuccinate Lyase (EC 4.3.2.2) 
(Adenylosuccinate) (ASL). - Bacillus Subtil 
Is. 


1 


-CAT 


TTA- 


99 


809 


Reverse 


301. 


ATr-Binaing Protein ofcXA. - Haemophilus 
Influenzae. 


i 
1 


ATT* 

AlU 




1 1 1 


Af\A 


Forward 




L- Lactate uenyarogenase (cytocnrome) (tx. 
1.1.2.3). - Escherichia Coli. 


•> 
z 


AlU 


TA A 




JU / 


Forward 


303. 


Phosphate Transport System Permease 
Protein PSTC. - Escherichia Coli. 


3 


ATG 


TGA 


507 


1070 


Forward 


304. 


Sulfate Transport ATP- Binding Protein 
i#ysa* ~ aynecnococcus or. v oirain i\-v- /v*f 


1 


PAT 

-LAI 


PTa 


1 m 




Reverse 


in*. 


uiuuiown 


•> 


-PAT 


TTA- 




1077 
11// / 


Reverse 


JUU. 


Unknown 


& 


„f AT 






7AQ 


Reverse 


30.7. 


Unknown 


2 


ATG 


TAG 


1108 


1290 


Forward 


JUo. 


Phosphoribosyiformylglycinamidine Cyclo- 

T tone* /"PP ft 1 W AIP9A fPhncnkrtrihn 

Svl*Aminoimidazole Synthetase) ( AIR 
Synthase). - Bacillus Subtilis. 


I 


TAT 

~\-A 1 


TT* A 


JJ 1 


All 


Reverse 


309. 


Unknown 


2 


ATG 


TAA 


313 


765 


Forward 


310. 


SMS Protein. - Escherichia Coli. 


1 


-CAT 


TCA- 


102 


416 


Reverse 


311. 


sisma 42 protein - Emerococcus faecal is 




-CAT 


TCA- 


8 


487 


Reverse 


312. 


Unknown 




-CAT 


TTA- 


76 


390 


Reverse 


313. 


Cuanylate Kinase (EC 2.7.4.8) (GMP 
Kinase). - Escherichia Colt. 


2 


-CAG 


TTA- 


415 


849 


Reverse 


314 


Unknown 


1 


-CAT 


TCA- 


51 


296 


Reverse 


315. 


Unknown 


1 


ATG 


TAA 


175 


285 


Forward 


316. 


Uknown 


2 


ATG 


TAA 


361 


558 


Forward 


317 


Uknown 


3 iATG 


TAA 


383 


1144 


Forward 
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318. 


PTS Svstem - 


) 


-CAT 1 


rcA- i 


66 * 


t65 1 


Reverse 


319. 1 
I 


*-Fucose isomerase (EC 5.3. 1 .-) - 
Escherichia Coli. 


1 


-CAT < 


7TA- < 


) i 


(82 ] 


Reverse 


320. 1 


Jnknown 


I 


-CAT ' 


ITA- * 


$95 < 


S50 1 


Reverse 


321. \ 


Unknown 


1 


\TG ' 


rGA 


130 


231 1 


: orward 


322. 


J-Oxoacyl-IAcyl-Canicr Proteinl Reductase 
(EC 1.1.1.100) (3-Ketoacyl- Acyl Car Rier 
Protein Reductase), - Escherichia Coli. 


1 


-CAT ' 


rcA- 


51 


200 


Reverse 


323. 


3-Oxoacyl-lAcyl-Caxrier Protein] Reductase 
(EC 1.1.1.100) (3-Ketoacyl- Acyl Car Rier 
Protein Reductase). • Escherichia Coli. 


2 


-CAT 


TCA- 


231 


6)4 


Reverse 


324. 


2-lsopropylmalate Synthase (EC 4.1.3.12) 
(Alpha-lsopropytmalate Synthase) (Alph A- 
IPM Synthetase). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactis). 


1 


-CAT 


TTA- 


31 


231 


Reverse 


325. 


2-Isopropylmaiate Synthase (EC 4. 1 .3.12) 
(Alpha-isopropylmalaie Synthase) (Alph A- 
EPM Synthetase). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactis). 


2 


-CAT 


TCA- 


191 


334 


Reverse 


326. 


Unknown 


j 


-CAT 


TCA- 


309 


452 


Reverse 


327. 


2-Isopropylmalate Synthase (EC 4.1.3.12) 
(Alpha-isopropylmalate Synthase) (Alph A- 
IPM Synthetase). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactis). 


4 


-CAT 


TTA- 


495 


1127 


Reverse 


328. 


Unknown . 


2 


-CAT 


TCA- 


170 


469 


Reverse 


329. 


Unknown , 


1 


ATG 


TAG 


105 


473 


Forward 


330. 


Strscaa Ncbi gi: 3I0629NCB1 gi: 473 - 
Streptococcus gordonii (strain PK488) DNA 


1 


-CAG 


CTA- 


81 


665 


Reverse 
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Codon 




Position 






SEO ID NO: 


Identity 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


331. 


Unknown 


1 


ATG 


TGA 


228 


374 




332. 


lysyl aminopcptidase (EC 3.4.1 1.15) 
precursor - Laccococcus lactis 


1 


-CAT 


TCA- 


107 


766 


Reverse 


333. 


Indole-3-Glycerol Phosphate Synthase (EC 
4.1.1.48) (IGPS). - Lactococcus Lactis 
(SUBSP. Lactis) (Streptococcus Lactis) 


1 


-CAT 


TCA- 


127 


369 


Reverse 


334. 


Anthranilate Phosphoribosyltransferase (EC 
2.4.2. 18).-Lactococcus Lactis (Sub SP. 
Lactis) (Streptococcus Lactis). 


2 


-CAA 


TCA- 


366 


641 


Reverse 


335. 


Tagatose-6-Phosphate Kinase (EC 2.7.1.-) 
(Phosphotagatokinase). - Lactococcus L 
Actis (Subsp. Lactis) (Streptococcus Lactis) 


1 


ATG 


TGA 


42 


524 


Forward 


336. 


Unknown 


1 


ATG 


TGA 


73 


474 




337. 


Unknown 


1 


-CAT 


TTA- 


32 


286 


Reverse 


338. 


Unknown 


1 


ATG 


TGA 


306 


572 


Forward 


339. 


Acetolactate Synthase Large Subunit (EC 
4.1.3.18) (AHAS) (Acetphydroxy-Acid Syn 
thase Large Subunit) (ALS). - Lactococcus 
Lactis (SUBSP. Lactis) (Streptococc us 
Lactis). 


I 


ATG 


TAG 


59 


502 


Forward 


340. 


Penicillin-Binding Proteins I A/IB. - 
Bacillus Subtilis. 


2 


ATG 


TGA 


535 


720 


Forward 


341, 


Unknown 


I 


ATG 


TAG 


165 


488 'Forward 


342. 


Anthranilate Phosphoribosyltransferase (EC 
2.4.2.18). - Lactococcus Lactis (SUB SP. 
Lactis) (Streptococcus Lactis). 


1 


-CAT 


TCA- 


5 


394 


Reverse 


343. 


ffrpE protein - Lactococcus Lactis 


2 


ATG 


TAA 


124 


543 


Forward 


344. 


Unknown 


t 


-CAT 


CTA- 


34 


195 


Reverse 


345. 


Unknown 


2 


-CAT 


TTA- 


377 


*>44 


Reverse 




Unknown 


2 


ATG 


TAA 


159 


503 


Forward 


347. 


Unknown 


2 


-CAT 


CTA- 


315 


644 


Reverse 


348. 


Unknown 


2 


-CAT 


CTA- 


468 


587 


Reverse 


349 


Unknown 


2 


-CAT 


CTA- 


482 


631 


Reverse 


350. 


Unknown 


2 


-CAT 


TTA- 


380 


757 


Reverse 


351. 


NIFS Protein Homoiog (Fragment). - 
Lactobacillus Delbrueckii (SUBSP. 
Bulgarians). 


I 


ATG 


TAG 


87 


797 


Forward 


352. 


"Ornithine Carbamoyl transferase 


1 


TTG 


TAA 


2 


502 


Forward 
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I353. I 


>SEGLI NCB1 gi: 499660 - Pseudomonase 1 
)Utida. 


|~ 


•CAT 1 


[TA~ f 


i0 4 


t81 ? 


leverse 


.1 

I354. 1 


Jnknown 




JTG n 


rAA 


1 : 


522 1 


-orward 


I355. 


rhioredoxin. - Streptomvces Clavuligerus. 




\TG ' 


rAG i 


58 


575 1 


Forward 


356. 1 

I 


D-alanine permease (dagA) homolog - 
Haemophilus influenzae (strain Rd KW20) 


1 1 


r\IVJ 


rAG 


185 


343 1 






Unknown . 


1 


^vrt 1 


TTA- 


78 


371 




|357. 
[358. 


"DNA Polymerase III 




-CAA 


TCA- 


82 


561 


Reverse 


359. 


Transport ATP-Binding Protein Coma. - 
Streptococcus Pneumoniae. 


1 




1 l/V" 




552 




1360. 


Unknown — 


L 


A IO 


TAA 


445 


540 




|36l. 


surface protein PspA - Streptococcus 


1 


-CAT 


TCA- 


45 


416 


Reverse 


362. 


Licid Protein. - Haemophilus Influenzae 


2 


ATG 


TAA 


268 


633 


Forward ' 


1363. 




2 


ATG 


TAA 


527 


640 


Forward 


1364. 


Glutamine Transport ATP- Binding Protein 
GLNO. - Escherichia CoJL m 


I 


CTG 


TGA 


1 


393 


Forward . 




Unknown , 


1 


ATG 


TAG 


184 


303 


Forward 


365. 

|366. 


Unknown ^ . . 


1 


ATG 


TGA 


794 


919 


Forward 


367. 


Lipoamide Dehydrogenase Component (E3) 
of Pyruvate Dehydrogenase Complex (EC I 
8.1.4) (Dihydrolipoamide Dehydrogenase). 
Azotobacter Vinelandii. _ 


1 


ATG 


TAA 


3 


416 


Forward 
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Codon 




Position 


i 




SEQ ID NO: 


Identity 


ORF# 


Start 


Stop 


Start 


(9W|t 


lit >>^yiiij>i 


368. 


Orotate Phosphoribosyltransferasc (EC 
2.4.2.10) (OFRT). - Bacillus Subtilis. 


1 


-CAT 


CTA- 


21 


311 


Reverse 


JQ7. 


Unknown 


i 
l 


AlVf 


T*A A 

TAA 


28 


309 


Forward 


J/U. 


orOuD- Associated GTr-oinoing rtroein. - 
□acinus ouDtius. 


l 


A TV" 

ATG 


TGA 


110 


538 


Forward 


J/ 1. 


i ransport A 1 r-oinauig rrotein CUM A. - 
o ircpiococ cus riicumoniaC . 


l 


-CAA 


TCA- 


47 


679 


Reverse 


372. 


prephenate dehydrogenase (EC 1.3.1.12) - 
lacujcoccus iscus. 


2 


ATG 



TAG 


271 


492 


Forward 




uoyaiogiycoprotein cnaopepuaase itu 

H&emoivtica. 


I 


CTG 


T" A A 

TAA 


1 


450 


Forward 


374. 


Unknown 


j 


-CAT 


TTA~ 

1 1 r\ 


7 




Reverse . 


375. 


Unknown 


] 


TTG 


TAA 
inn 


*) 




Forward 


376. 


"DNA Polymerase III 


1 


ATG 


TGA 


j 


40 1 

HUl 


Forward 


377. 


Unknown 


2 


ATG 


TAA 


Jv 1 


J JO 


rOrWBTO 


378. 


Dihvdrodinicnhnaie Synthase fFC 4 2 1 51\ 
(DHDPS). - Bacillus Subtilis. 


1 


-CAT 






** J / 


Reverse 


379. 


Unknown 


2 


-CAT 




](\A 




Reverse 


380. 


Unknown 


2 


-CAT 


TCA- 




640 

0*f 


Reverse 


381. 


Unknown 


1 


-CAT 


TTA- 


JO 




Reverse 


382. 


Unknown 


2 


-CAT 


TTA- 


254 


409 


Reverse 


383. 


Unknown 




ATG 


TAG 


lift 

t JO 


oi / 


Forward 


384. 


Unknown 


2 


-CAT 


TTA- 


225 


479 


Reverse 


385. 


Unknown 


2 


ATG 


TAA 


747 


857 


Forward 


386. 


Unknown 


1 


ATG 


TAG 


294 


443 


Forward 


387. 


u ii Known 


L 


ATT. 

A1U 


TV** A 


356 


544 


Forward 


388. 


Unknown 


\ 


-CAT 


CTA- 


4 


141 


Reverse 


389. 


Unknown 


I 


CTG 


TAG 


1 


579 


Pnru/nrrt 
rui wtuu 


390. 


Unknown 


2 


-CAT 


TTA- 


309 


452 


Reverse 


391. 


Phosphopentomutase (EC 5.4.2.7). - 
Escherichia Coli. 


1 


ATG 


TAA 


3 


233 


Forward 


392. 


D-Alanyl-D-Aianme Carfooxypeptidase 
Precursor (EC 3.4.16.4) (DD-Peptidase)(DD 
• CarboxypeptidaseXCpase)(PBP5). - 
Bacillus Subtilis. 


1 


-CAT 


TTA- 


52 


537 


Reverse 


393. 


Na+ and Cl-dependem gamma-aminobutryic 
acid transporter homoiog-Haemophilus 


1 


TTG 


TGA 


2 


268 


Forward 
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i 


nfluenzae (strain Rd KW20) 














394 ] 


Unknown 


2 


ATG 


TGA 


319 


546 


Forward 


395. i 


dihydrolipoamide dehydrogenase (EC 
1 .8. 1 .4) - Pelobacter carbinolicus 


1 


ATG 


TGA 


3 


284 


Forward 


396. 


Unknown 


2 


ATG 


TGA 


241 


450 


Forward 


397. 


Beta-Glucosidase A (EC 
3^.1.21XGcntiobiaseXCellobiaseKBeta-D- 
Glucoside Glucohydrolase). - Clostridium 
Thermocellum. 


1 


ATG 


TAA 


184 


453 


Forward 


398. 


Uracil Permease. - Bacillus Caldolvticus. 


I 


ATG 


TAA 


93 


353 


Forward 


399. 


Unknown 


2 


ATG 


TAG 


127 


516 


Forward 


400. 


ligoendopeptidase F- Lactococcus lactis 


l 


ATG 


TGA 




110 


r\Ji Waiu 


401. 


Stratpasea NCB1 gi: 1S3565NCB1 gi: 4- 
Streptococcus Faecalis DNA. 


2 


ATG 


TAA 


392 


568 


Forward 


402. 


Unknown 


2 


ATG 


TAA 


376 


507 


Forward 


403. 


Exodeoxyribonuclease Small Subunit (EC 
3.1.1 1.6) (Exonuciease VII Small Subunit). - 
Escherichia Coli. __ 


2 


-CAT 


TCA- 


470 


682 


Reverse 


404. 


ATP-Depenem DNA Helicase RECG (EC 
3.6.1.-). - Escherichia Coli. 


1 


ATG jTGA 


3 


455 


'Forward 
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Codon 




Position 






SEQ ID NO: 


Identity 


ORF# 


Start 


Stop 


Start 


Stop 




405. 


Possible thiamin biosynihetic enzvme 


1 


-CAT 


TTA- 


15 


347 


Reverse 


406. 


SPOU Protein. - Escherichia Coli. 


2 


ATG 


TAA 


322 


618 


Forward 


407. 


Malnnvl PV*a. Ar>vl C*arrim* Pmimn 
1'iotuiiji v~ua~/\i-yi v<u i icr r i uicin 

Coli. 


i 


ATT! 


IJvJ 






Forward 


408. 


Unknown 


I 


-CAT 


i /\— 




1 is 

I 10 




Reverse 


409. 


nucleoside dirjhosnhate kinase ( ndk « 
homolog- Haemophilus influenzae (strain 
RdKW20) 


1 


CTG 


TRA 


1 

I 




Forward 


410. 


Nucleoside Diphosphate Kinase (EC 
2.7.4 6MND1G (NDP Kinase) - Escherichia 
Co LI. 


2 


ATG 


TAG 


215 


481 


Forward 


411. 


Unknown 




-CAT 


TTA- 


21 




Reverse 


412. 


Unknown 




-CAT 


TCA- 


162 


314 


Reverse 


413. 


Unknown 




ATG 


TAA 


187 


417 


rorwaro 


414. 


Unknown 




ATG 


TGA 


316 


417 


Fr*r\i/*rH 


415. 


Unknown 




ATG 


TGA 


316 


453 


Forward 


416. 


Enolase (EC 4 2 11 1 V2-Phosnhoplvcerate 
Dehydratase)(2-Phospho-D0Glycerate 
Hydro-Lvase). - Bacillus Subtilis. 




-CAT 


TTA- 


4 


*Oj 


Reverse 


417. 


Unknown 




CTG 


TGA 


1 


363 


Forward 


418. 


Unknown 




ATG 


TGA 


39 


383 


Forward 


419. 


PILB Protein. - Neisseria Gonorrhoeae. 




-CAT 


TTA- 


145 


327 


Reverse 


420. 


Unknown 




ATG 


TGA 


285 


533 


Forward 


421. 


Unknown 


1 


CTG 


TAG 


2 


379 


Forward 


422. 


imegrase/recombinase (xprB) homolog - 
Haemophilus influenzae (strain Rd KW20) 


1 


ATG 


TAA 


305 


421 


Forward 


423. 


Unknown 




-CAT 


CTA- 


173 


436 


Reverse 


424. 


Unknown 


1 


-CAT 


CTA- 


182 


427 


Reverse 


425. 


Unknown 




ATG 


TAA 


49 


372 


Forward 


426. 


"Mutator Mutt Protein (7 




-CAT 


CTA- 


21 


446 


Reverse 


427. 


Unknown 




ATG 


TGA 


177 


380 


Forward 


428. 


Possible phosphatase 




ATG 


TAG 


112 


402 


Forward 


429. 


Unknown 




ATG 


TGA 


110 


271 


Forward 


430. 


Phosphoenolpymvate Carboxylase (EC 
4.1.1.31). - Corvnebactenum Glutarrucum 




ATG 


TGA 


337 


561 


Forward 


431. 


Unknown 




ATG 


TGA 


214 


324 


Forward 


432. 


50S Ribosomal Protein L31. - Bacillus 




ATG 


TAA 


142 


426 


Forward 
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SubUiis. _ 














433. 


glycosyl transferase (IgtD) homolog - 
Haemophilus influenzae (strain Rd KW20) 


1 


AIu 




IZ i 


J77 


lUI will u 


Aid 


Unknown 


2 


-CAT 


CTA- 


244 


435 


Reverse 


lUnknown , 


*> 


ATG 


TGA 


118 


309 


Forward 


436. 


Factor essential for Expression of 
Methicillin Resistance. - Staphylococcus 
Aur Eus. 


2 


-CAT 


TTA- 


168 


434 


Reverse 


437. 


Unknown . 


1 


-CAT 


TCA- 


2 


214 


Reverse 


438. 


Unknown - 


1 


TTG 


TAA 


2 


142 


Forward 


439. 


Unknown 


1 


ATG 


TAA 


30 


191 


Forward 
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Codon 


1 Position 






SlCtV/ 1" NU: 


identity 


ORF ff 


Start 


Stop 


Start 


Stop 


Direction 


440. 


Aspartate Aminotransferase ( EC 2.6. 1.1) 
(Transaminase A K AS PAT). - Bacillus SP. 
(STRAIN YM-2V 




-CAT 


TCA- 


63 


416 


Reverse 


441. 


Unknown 




ATG 


TAA 


52 


342 


Forward 


442. 


Unknown 




-CAT 


TCA- 


210 


455 


Reverse 


443. 


Unknown 




TTG 


TGA 


2 


517 


Forward 


444. 


ATP-Dependent DNA Helicase RECG 
(EC 3.6. 1 .-) - Eschenchia Coh. 




ATG 


TGA 


83 


376 


Forward 


445. 


Unknown 




ATG 


TAA 


70 


Jo* 


Forward 


446. 


Cell Division Protein FTSA. - Bacillus 
Subtilis. 




TTG 


TAA 


3 


3 / 1 


Forward 


447. 


Unknown 




ATG 


TAG 


70 


AA 1 
**■» 1 


Forward 


448. 


Unknown 




ATG 


TAG 


104 


454 


Forward 


449. 


Unknown 




CTG 


TAA 


1 


159 


Forward 


450. 


I Inlrnnwn 




ATT! 


TAA 


120 


J47 


Forward 


451. 


I Ink nnum 




A TTl 
Alu 


TGA 


31 


423 


Forward 


All 


I TnVnnu/n 






TGA 


225 


A > C 


Forward 


453. 


Unknown 




ATG 


TGA 


290 


418 


Forward 


•tjH. 


unknown 




ATG 


TGA 


3 


269 


Forward 




mxcin \3r\jiiu Associaiea r roiein ). - 

Bacillus ^iihfilic 






TGA 


93 


365 


Forward 


456. 


3 2^ 20V3-Methvladenine-DNA Glvcncvlas 
EI 




ATT, 


TAG 


91 




Forward 


457. 


"Glucan 1 




-CAT 


TTA- 


4 


150 


Keverse 


458. 


Unknown 




-CAT 


CTA- 


245 


400 


Reverse 


459. 


Glutamate/Aspartate Transport ATP* 
Binding Protein GLTL. - Escherichia Coli. 




-CAT 


TCA- 


81 


218 


Reverse 


460. 


Unknown 




-CAT 


TTA- 


103 


492 


Reverse 


461. 


Unknown 




ATG 


TGA 


305 


484 


Forward 


462. 


Unknown 




-CAT 


TCA- 


29 


355 


Reverse 


463. 


Unknown 


2 


-CAT 


TTA- 


572 


838 


Reverse 


464. 


Unknown 


2 


-CAT 


TCA- 


652 


1026 


Reverse 


465. 


Unknown 


2 


-CAT 


TTA- 


318 


764 


Reverse 


466. 


Unknown 


2 


ATG 


TGA 


719 


805 


Forward 


467. 


Llcpyrda NCB1 gi: 5 11014 - Lactococcus 
lactis. 


1 


ATG 


TGA 


134 

i 


472 


Forward 
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468. 


Unknown 


r 

I 1 


ATG 


TGA 


385 


492 


Forward 


469. 


Unknown 


2 


ATG 


TAA 


587 


721 


Forward 


470. 


salE protein - Neisseria meningitidis 




-CAT 


TCA- 


23 


460 


Reverse 


471. 


Unknown 


2 


-CAG 


TTA- 


717 


1319 


Reverse 


472. 


Naphthoate Synthase (EC 4.1.3.36) 
(Dihydroxynaphthoic Acid 
SynthetaseKDhna Sy Nthetase). - 
Escherichia Coli. 


1 


ATG 


TGA 


97 


414 


Forward 
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Codon 




Position 






SEQ ID NO: 


Identity 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


473. 


Unknown 


1 


CTG 


TGA 


1 


246 


Forward 


474. 


Unknown 


1 


-CAT 


TCA- 


341 


748 


Reverse 


475. 


Unknown 


1 


-CAT 


TTA- 


217 




Reverse 


476. 


Unknown 


1 


-CAT 


TTA- 


499 


729 


Reverse 


477. 


Multiple Sugar-Binding Transport ATP- 
Binding Protein MSMK. - Streptococcus 
MUTANS. 


2 


ATG 


TAA 


407 


571 


Forward 
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Table 2 shows the cofrelation between the SEQ ID NO of each DNA sequence of 
the invention with the SEQ ID NO(S) of polypeptide or polypeptides that its open reading 
frames) encodes. For example, the DNA of SEQ ID NO:l encodes one polypeptide, that 
of SEQ ID NO:224. Whereas, the DNA of SEQ ID NO:2 encodes two polypeptides, the 
polypeptides of SEQ ID NO:225 and SEQ ID NO.226. 



TABLE 2 



DNA 


Protein (oDen reading frame) 






1 


224 




2 


225,226 




3 


227 




4 


228,229,230 




5 


231 




6 


232 




7 


233 




8 


234 




9 


235 




10 


236 




11 


237 




12 


238 





13 


239 




14 


240 




15 


241 




16 


242 




17 


243.244 




18 


245 




19 


246 




20 


247,248 




21 


249.250 




22 


251 




23 


252 




24 


253.254 




25 


255.256 




26 


257 




27 


258 




28 


259 




■ 29_ 


260,261 




30 


262 
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31 


263 




32 


264 




33 


265 




34 


266 




35 


267 




36 


268.269 




37 


270 




38 


271 




39 


272 




40 


27^ 




41 


274 




42 


27 5 




43 


276 




44 


277 




45 


278 




46 


279 




47 


280 




48 


281 




49 


282 




50 


283 




51 


284 




52 


285 286 




53 


287 




54 


288 2RQ 




55 


290 




56 


291 




57 


292 




58 


293 




59 


294,295 




60 


296,297 




61 


298,299 




62 


300 




63 


301 




64 


302.303 




65 


304,305 




66 


306 




67 


307 




68 


308 




69 


309 




70 


310 




71 


311 




72 


312,313 
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HI 

15 


114 




HA 

74 


lie 117 




/J 


lift 




/O 


no i?n 




HH 






78 


197 ioi 




HC\ 


17A 17S 196 197 




8U 






81 


no 
3zy 




82 


330 




83 


331 




84 


332 




85 


AAA AM > 

333,334 




86 


n c 

335 




87 


336 




88 


337 




89 


338 




90 


339 




91 


*\ Af\ 

340 




92 


341 




93 


342 




94 


i /• i 
343 




95 


344,345 




96 


346 




97 


347 




98 


*> AO 

348 




99 


349 




100 


1C/\ 

350 




101 


351 




102 


352 




103 


1 CI 

353 




104 


354 




105 


*J cc 

355 




106 


356 




107 


1<*7 

33/ 




108 


ICQ 

358 




109 


359 




110 


360 




111 


361 




Ii2 


362 




113 


363 




114 


364 
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115 


365 


116 


366 


117 


367 


118 


368 


119 


369 


120 


370 


121 


371 


122 


372 


123 


373 


124 


374 


125 


375 


126 


376 


127 


377 


128 


378 


129 


379 


130 


380 


131 


381 


132 


382 


133 


383 


134 


384 


135 


385 


136 


386 


137 


387 


138 


388 


139 


389 


140 


390 


141 


391 


142 


392 


143 


393 


144 


394 


145 


395 


146 


396 


147 


397 


148 


398 


149 


399 


150 


400 


151 


401 


152 


402 


153 


403 


154 


404 


155 


405 


156 


406 
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157 


407 


— 


158 


408 




159 


409,410 




160 


411 




161 


412 




162 


413 




163 


414 




164 


415 




165 


416 




166 


417 




167 


418 




168 


419 




169 


420 




170 


421 




171 


422 




172 


423 




173 


424 





174 


425 




175 


426 




176 


427 




177 


428 




178 


429 




179 


430 




180 


431 




181 


432 




182 


433 




183 


434 




184 


435 




185 


436 


. 


186 


437 




187 


438 




188 


439 




189 


440 




190 


441 




191 


442 




192 


443 




193 


444 




194 


445 




195 


446 




196 


447 




197 


448 




198 


449 





50 



WO 97/37026 



PCT/US97/05306 



1QQ 




700 


451 


201 


All 


202 


453 


203 


454 


204 


455 


205 


456 


706 


457 


707 

ZU f 


45ft 


ZUo 




LKTi 




710 
ZIU 


*fOI 


ZI J 


AAO A£Sl 


0X0 
Z1Z 




ZI J 


*K>J 


OiA 

zJ4 


40O 


*> i ^ 
ZI D 


40/ 


ZIO 


4Oo,40y 


0\H 
Zl / 


AH{\ AH\ 
4 /U,4 / 1 


218 


472 


219 


473 


220 


474 


221 


475 


222 


476 


223 


477 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

<i) APPLICANT: Black, Michael 
Hodgson, John 
Knowles , David 
Nicholas, Richard 
Stodola, Robert 

(ii) TITLE OF THE INVENTION: Novel Compounds 

(iii) NUMBER OF SEQUENCES: 477 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 709 Swede land Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-0939 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 01-APR-1997 

(C) CLASSIFICATION: 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUME2R: 60/014690 

(B) FILING DATE: 02-APR-1996 

PRIOR APPLICATION DATA : 

(A) APPLICATION NUMEER : 60/025768 

(B) FILING DATE: 22-AUG-1996 

(viii) ATTORNEY / AGENT INFORMATION: 
(A) NAME: Gimmi , Edward R 
{B) REGISTRATION NUMBER. 38,891 
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(C) REFERENCE / DOCKET NUMBER: P50466 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-4478 

(B) TELEFAX: 610-270-5090 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CTCAAACATG 


GTCTCTTCTA 


GTTGCATGGT 


CGCAATCGGA 


TCCAAGGCTG 


AGGCTGGGCT 


60 


CATCCATTAA 


GAGGATATCT 


GGCTTAACAG 


AGATGGCACG 


AGCGATACAG 


AGACGTTGTT 


120 


GCTGACCACC 


TGATAAGGTC 


AAGGCTGACT 


TGTGGAGATC 


GTCTTTAACC 


TGATCCCAAA 


180 


GGGCAGCCTG 


ACTAAGGGAG 


GTTTCTACGA 


TTTCATCTAG 


GACTTGCTTA 


TCCTTAACTC 


240 


CAGCACGTTC 


ATGCGCAAAG 


GTAATATTAC 


GGTAAATTGA 


CTTAGCAAAT 


GGATTGGGGC 


300 


GTTGAAAAAC 


CATTCCAATG 


TGTTTACGCA 


TTTCATAAAC 


GTTGATTTCT 


GGACGGTTGA 


360 


CATCAATTCC 


ACGATAGAGA 


ATCTGCCCAG 


TTACTTTAGC 


AATATCAATG 


GTATCATTCA 


420 


TGCGATTGAG 


ACTGCGTAAG 


TAGGTAGATT 


TCCCCGATCC 


CGACGGACCA 


ATCAAAGCTG 


480 


TAATTTTATT 


TCCTTTCAAA 


TTGCATATCA 


ATCCCCTTAA 


TGGATTCATT 


TTTACCATAG 


540 


TAAACATGGA 


CATCCTTAGT 


AGAAAGGGCT 


ACTTTTTCTT 


CAGGAAAGGT 


AAGGATATGC 


600 


TTCTCATCCC 


AGTTATATGT 


TGACATGGCT 


TCTCCTTTAG 


GCAGCGGTTA 


ATTTCTTGTG 


660 


TAGATAGCTT 


CCGAACTTAC 


GAG 








683 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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CTAGTCCAAC TAACTGAGTT TTCCTTTATC TATTATATCA AATATAAGTC CGTTTGTAAC 60 
TAGTGAAGAA TTCTTTTGTC CGCTCTTCTT TAGGGGTGTG GATAATCTCA TCCGGAGTTC 120 
CAGACTCGAT GATTTTCCCC TTATCTAAGA AGAGAATTTT ATCCGCAACT TGGGCTACAA 180 
AGGGCATGTC ATGACTGACC AAAATCATGG TCTGACCTGA CTTAGCAGCA TCTGCAATAG 240 
ACTTTTCTAC TTCACCGACC AATTCTGGGT CAAGGGCTGA AGTTGGTTCG TCTAAGAGCA 300 
AAACATCTGG TTTCATAGCA AGCGCACGCG CTAGGGCAAC CCGTTGCTTC TGTCCACCTG 360 
ATAAATGGCG AGGATAATGG TTTTCACGGT CCGAAAGCCC AACCTTAGCC AACTCTTCCT 420 
TGGCAATCTT AGTCGCTTCT TGGTCAGATA ATTTCTTGAC AACAACCAAG CCTTCTTTCA 480 
CATTATCAAG TGCTGTTCGG CGTTCAAATA AATTAAACTG TTGGAAAACC ATAGACAACT 
TACGGCGTAG GGCAAGGATT TCTTCTTGAG TGATTTTAGA AAAATCAACT GAAAAACCAT 
CAATCTGAAT AGAGCCACTG TCAGGTGTTT CAAGATAATT GAGACTGCGA AGAAAGGTTG 
ATTTTCCAGC TCCTGAAGAA CCAATCAAGG CTACAACTTC CCCTTTTTGA ATATCCAAGT 
TCAGATGATC CAAGACAGTC TGTCCTGAAA AGGATTTGCT TAAATTCGAA ATCTTAATCA 
TTAACGAAGG TCTCCTTTCA CATCTGTTTG CACTGTATCA GGTGCAGAAA TAGCCATTTT 
TCTCTCGATG AAACGACCGA GGCTTTCAAT TCCGATATTG ACTACCCAAT AAACAAGGGC 
AACAGAGATG AAGCGTTCAA AATAGCGATA ATCAGCTCCA CCTAGAATCT GAGCTTGGGC 
AAAGACTTCC ACAACACCCG CACTAAAAGC TAGAGATGTT CCCTTGGTCA AACCGATGAG 
GGAATTAATC AAGGTTGGAG TAGCTACCAC CGCTGCATTA GGAATAATCA CTCGTCGATA 
AACTTGCGCT CGGGTCATAC CCAGACTGCG CGCCGCCTCA ATCTCACCAG GATTAACTGA 
GAGAATGGCT GCACGAATGG TTTCACTAGC ATAAGCTGCC TCATTAAAGG CAAAAGCGAC 
AATCGCAAAA GCTGCAGCTG GAATCGCATT GATATTGAGA CCAGTTCCCC ATTGCTGATT 
GAGGGCTTTC AAAGCCAAAG GGATTCCGTA GTAGGTCAAC ATGAGTTGCA CCAAAATCGG 
TGTCCCTTTT AAGAAACTAA CAAAGAAGGC CTGCAAGGGA TATAAAATCT TGACACGATT 
GATCTTCACA ATGGCAAAAA GAAGCGCCAA AACCAAGCCA AAAAGGGCAC CGCCAATTGT 
CAACATAATT GTTGTTGGAA GTTGTTGGAC AATTCTAGGG ATTCCATCAA AGACCGAACG 
TAGGCTAAAC AG 

(2) INFORMATION FOR SEQ ID NO: 3: 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1512 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

" (ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



CTGGAGATTA TTGAGATTGC CCGTCAAAAC GATTTGATTA TNTTTGCGGA TGAAATCTAT 
GAC r GCATGG TAAATGGACG GACATGTGCA TAACGCCTGT GGCGAGCTTG GCACCAGATG 
TCTTCTGTGT CAGCATGAAT GGTCTGTCAA AATCCCACCG CATAGCAGGT TTCCCGTGTG 
GGATGGATGG TCTTGTCTGG CCCTAAGACT CATG1TAAGG GCTATATCGA AGGGCTCAAT 



60 
120 
180 
240 
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ATGCTGTCCA 


ATATGCGCCT 


TTGCTCTAAC 


GTTTTGGCTA 


CCAACGCTGC 


ATTAGGAATA 


300 


ATCACTGTCG 


TACAAACTTG 


CGCTTGGGGG 


GTCACCAATC 


AGTCGATGAA 


TTGCTTCTTC 


360 


CTGGTGGACG AATCTACGAG 


CAAAGAAATT 


TCATCTATAA 


TGCCATTCAA GATATTCCAG 


420 


GTTTGTCTGC 


CGTTAAACCC 


AAGGCGGGGC 


TCTATATCTT 


CCCAAAAATC 


GACCGCAATA 


480 


TGTACCGTAT 


CGATGATGAT 


GAGCAGTTTG 


TCCTTGATTT 


CTTGAAGCAG 


GAAAAGGTTC 


540 


TCTTGGTTCA 


TGGTCGAGGC 


TTTAACTGGC 


AGGAACCAGA 


CCACTTCCGT 


ATCGTTTACC 


600 


TTCCTCGTGT 


TGATGAGTTA 


GCCCAAATCC 


AAGAAAAGAT 


GACTCGTTTC 


TTGAAACAGT 


660 


ATCGTAGATA 


GGGCTTGCAT 


TCGAAAAAGC 


TGGAAACATT 


TGCCTAGAG 




709 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



CTGCTTCAGA 


ACCTGCCAAA 


CCAGCACCGA 


TAACATTGAT 


ATAAGATTGA GACACGACAC 


60 


TAATACCTCT 


TTGGGAGTGT 


GAAGTTAAGA 


TTCACATTGA 


AAAAGCCAAT 


CAGACTTACA 


120 


AGCTTTCGAG 


TTTCTTGGCT 


CAGGCTGAAA 


AAGTCCACAG 


GGCTTTTNCA 


CTCCCACAAA 


180 


TCTTTCTATT 


TTTTCTNCTA 


CTAGTATAAC 


AAAAAAAGGG 


AAGAAGGNAA 


ACTTCCCTGT 


240 


TTAGTCATTT 


TCTTGATGTA 


AAGAGATAGT 


GAGTATTCCA 


GTTAAGAATC 


AATTATATGC 


300 


TACTCTATAA 


AATCTTTTCC 


ACATAACGGA 


TCGATAGGGA 


CTGTTATTCT 


ATCTCTTGCT 


360 


ATAACCGTAT 


TATCTAAAAC 


AGCATAACAT 


TCAACATAGT 


GATCTCCTTT 


AAACTGTGAA 


420 


TCTTCCGTGA 


TATTTTATTT 


TACCTGAAAA 


AATAGCACAC 


GCTCACAATT 


CTTCCTAATA 


480 


GCCTCAGCTT 


CAATATTTCT 


TACTTTCCAA 


TAGACTCCCT 


GCGAAACAAA 


ATATGGTATA 


540 


GTAGTTCTAT 


GAATGATGAA 


GCAAGTAAAC 


AACTAACTGA 


TGCACGATTT 


AAGCGTCTTG 


600 


TTGGTGTTCA 


GCGCACGACT 


TTTGAAGAGA 


TATTAGCTGT 


ATTAAAAACA 


GCTTATCAAC 


660 


TTAAACACGC 


AAAAGGTGGA 


CGAAAACCTA 


AATTAAGCCT 


AGAAGACCTT 


CTTATGGCCA 


720 


CTCTTCAATA 


TGTGCGAGAA 


TATCGAACTT 


ATGAACAAAT 


TGCGGCTGAT 


TTTGGTATCC 


780 


ACGAAAGCAA 


CTTAATCCGT 


CGGAGCCAAT 


GGGTTGAAGT 


AACTCTTGTT 


CAAAGTGGTG 


840 


TTACGATTTC 


AAGAACTCCT 


CTCAGTTCTG 


AGGACACGGT 


AATGATTGAT 


GCGACGGAAG 


900 


TACAAATCAA 


TCGCCCTAAA 


AAAAGAATTA 


GCGAATCATT 


CTGGTAAAAA 


GAAATTTCAC 


960 


GCTATGAAGG 


CTCAAGCGAT 


TGTCACAAGT 


CAAGGGAGAA 


TTGTTTCTTT 


GGATATCGCT 


1020 


GTGAACTATA 


GTCATGATAT 


GAAGTTGTTC 


AAAATGAGTT 


GCAGAAATAT 


CGGACAAGCT 


1080 


GGAAAAATCT 


TGGCTGATAG 


TGGTTATCAA 


GGGCCCATGA 


AGATATATCC 


TCAAGCACAA 


1140 


ACTCCACGTA 


AATCCAGCAA 


ACTCAAGCCG 


CTAATAGCTG 


AAGATAAAGC 


TTATAACCAT 


1200 


GCGCTATCCA 


AGGAGAGAAG 


CAAGGTTGAG 


AACATCTTTG 


CCAAAGTAAA 


AACGTTTAAA 


1260 


ATGTTTTCAA 


CAACCTATCG 


AAATCATCGT 


AAACGCTTCG 


GATTACGAAT 


GAATTTGATT 


1320 


GCTGGCATTA 


TCAATTATGA 


ACTAGGATTC 


TAGTTTTGCA 


GGAAGTCTAT 


TATTTTCCTT 


1380 
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ATTGTCTGTA AGTCTACTGA CCTTGTTGTT TATCCCAGTC ATGGTTTCTA GTTCGGGCTC 
AGAGTTTCAA AGTGGATGGC AAGAGCATCA ATTGATTGCT GAGAAGGTTA GTAAAACACT 
TGACAAGACA TTTGATAAGG ATGTCAGAAA AATTCCGACC AGTCCAGTTT TATCAAAAAT 
TTGTAGATGA GATGGGAAGG ATTTACTC 



1440 
1500 
1560 
1588 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5; 



CTGTTTTTAA 
TAAATCGTGC 
TTTTGTTTCG 
AATAAGATTG 
GCGATTGGTG 
ACTGCATCAC 
ATTTGGTGAC 
TCCTCCGATT 
AAGTTTTTCT 
GTCCATCAAC 
GAGCAAGGAT 
TTCTAATCGA 



TACAGCTAAC 
ATCAGTTAGT 
CAGGGAGTCT 
GCCCAGCTTC 
TAAAGATATC 
AGTGAACATC 
TTGAGTTAAC 
TTATTTTTTA 
GGTTCGGAAA 
TGAGCCAGAA 
ACTTCTACTT 
ACAACCACCA 



ATCTCTTCAA 
TGTTTACTTG 
AATATTGTCA 
GATAACTGGG 
GTAACCTTTC 
ATAACCACGG 
ACCTGCACCG 
ATAGACAAGA 
TTTTTGATAG 
TATTTGTTTG 
CCTTATCAGG 
CTTTTCTCAG 



AAGTGGTACG 
CTTCATCATT 
AATACTGGAG 
ATACCTGGTT 
ATAAGGTCTT 
TTTGAAAGTT 
CAGGCAGCAA 
TTAAGCGGTT 
GTCTTCAAGA 
ACTTGAACTT 
AGCTATCATA 



CTGAACACCA 
CATAGAACTA 
CGCTCATTGC 
CAAAACCAAG 
CGTTTACATC 
CTTCTTCTAG 
GAATTTTAAT 
GCTTCAGCAA 
TGTCCATTTC 
GAGTTATTGA 
TTGTGAAAAG 



ACAAGACGCT 
CTATACCATA 
TGGTATACGG 
ATCTGTTGCA 
TTTCACCATG 
AGCACTTTTA 
CATTTAGATT 
TGTAAGCATA 
CTGTGAAAAA 
TGATAAAGAA 
TTACTGATTT 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
690 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ACTGCTACCT TTAAGAAGAT AGTAGACGTA TATACTTTTT TAAGAAAATC AAAAAGATAC 
TATAAAAAAT CTATTTGTTT ATTGAATTTA AGACTTTGGT AACAAATTGA AAATAAAAAG 



60 
120 
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GAGGTATTCA TCATGAATAC AAAAATGATG TCACAATTTT CTGTTATGGA TAATGAAATG 
CTTGCTTGCG TTGAAGGTGG AGATATTGAT TGGGGAAGAG AAATTAGTTG TGCAGCAGGG 
GTTGCATATG GCGCAATTGA TGGGTGTGCA ACAACGGTTT GATATTTCTA TTGGGACCAT 
TTGCTATAGG AATAGGTGTA ACTGGTGCTG CAGGTGGAG 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CTGAAGACCT CGTGCTACTC CTCCTCCAAC ACCTGCTTTG GCAAAATCTC CCCAATTGCA 60 

TCCGCCACCT TCAACTCAAG CAAGCATCTC AGTATCCATA ATTTCAAATT GTGACATCTT 120 

TTTTGTATTC ATAACGAATA CCTCTTTTTT ATTTTTAATA TTTGTCTTGT TACAAACTTG 180 

ACAAGTTTAG TATAACAGTA TCTATTAATT TTTTTCATCC AAATCTTGAA TTGGCATCGA 240 

AACGTCTTGA ATTAGCTTTT TTGTTTCAAA ATCATCTCTA TTTTTAAAAA AGATGTTTTC 300 

TAATCACTTT TTTACTATTT AGACTTCCTG CAAAACTAGA ATCCTAGTTC ATGATTGATA 3 60 

ATACCAGCAA TCAAATTCAT TCGTAATCCG AAGCGTTTAC GATGATTTCG ATAGGTTGTT 420 

GAAAACATTT TAAACGTTTT TACTTTGGCA AAGATATTCT CAACCTTGCT TCTCTCCTTA 480 

GATAGCGCAT GATTACAGGC TTTATCTTCA ACTGTTAGCG GCTTGAGTTT GCTGGATTTA 540 

CGTGGAAGTT TGTGCTTGAG GATATATCTT CATGAGCCCT TGATAACCAC TGTCAGCCAA 600 

GATTTTACCA GCTTGTCCGA TATTTCTCAT TTCTAAAAAC CATCTACTTC CGTTGACTTG 660 

GAACTCCGAT TGCTATTTTC CTTGAATGAT TTTAACCACA TCTCCTACAC TTTGGAGTTG 720 

GTCAATTTCC TCATCGCTGA TTTCGATACT AAATTCATCC TCCAGCGTCA AGATAAACTC 7 80 

CATCAAATCA ACTGAGTCAG CATCCAAGTC GTCTTTCAGA CTCAAGGATT CTGTCACGAC 840 

AAAGTCCTCT CCCTGTCGCT CTTGGATAAT GGTCACAATA CTGTCAAAAA TTTCTTTTTC 900 

TCTCATCTCT TTTATTCTCC TGAAAATTCA CGCGCAGTCT GGGCAACTAC TTCTGTTTCT 960 

AGCATGGTAC GAATCTGGCG AATCGTACTA TAAACAGCCT TGGCATCGCT TGAGCCATGA 1020 

GTCTTGACAA CAGGTGCCTT GACACCAAAC AfcGACCGCTC CACCAACATC TGAATAATTG 1080 

AGCTGTTTTT TCAAACCTCT GAGGCTGTCC TTGAGAAGGA GGGCACCTAG TTTCGCTCGA 1140 

AGACCACCAC CTGTAATAGC TGTCTTGAGC AAGCCCATGA TTCCCATAGC TGTCCCTTCG 1200 

ATGGATTTGA GCACAGCGTT TCCCGTGAAA CCATCTGCCA CAACAACATC TGCAACGCCA 1260 

TTCATCAAAT CACGCGCTTC CACGTTTCCG ATAAAGTTCA AACTTTCATC AGCCGCCAGT 1320 

AATTCATAAG TTTCCTTACG AAGCGGGTCG CCCTTGCTAC TCTCTGTTCC GTTGTTGAGC 1380 

AAACCAACAC GTGGTTGCGC AATGCCACGA ACATTCTTGG CATAGAAAGA ACCTAGGACA 1440 

GCGTATTGAT GGAGGTGCTG GGCTGTATTT TCTGCA.TTAG CACCGAGGTC AAGCATGTCA 1500 

AAACCTTTCC CATCTACAGT CGGCAATGTT GACATAAGTC CAGGACGGTC GATATTCTTG 1560 

ATACGACCCA CGATGAAGAA TCCAGCAGCC AACAAAGCAC CTGTATTCCC AGCCGAAAGG 1620 
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ACAGCGTCTG CTTCACCATC TTTGACAGCC TTGGCTGCCA ATACCATACT GGCATTTTTC 1680 

TTATTCCGAA TAG 1693 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 616 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



,60 
120 
180 
240 



360 
420 
480 
540 
600 
616 



CTAAATCTGC TTGCTTAGTC CACTTGCTTG AGCCAAGGAG TCACATACTC AAAATTTTCA 
TGAAAGTCAT AGGTACCGTC TTCTTTTTTA GCTGAAAAGA AAAGTCCATC GTGGTAGAGC 
AAATCCCCGT GGGCCATAAT TCTGGCAGTT TTTTCCTCTT CCTACTCCTG AGACTTTTGC 
TTAGTCCCCT CTTGAGAAAT AGTATCTCGT TTTTGACTAG TCAAGGGATT CCTTGGAAGC 
TTTCAAACAA CAAGACCAAG CCCATTGATA AACCAACTGC TAGCAAGAGT ATCGCCACAA 300 
ATCCCTTATT GCTCCACTTG CGATAACTCC TAAAAAGTTT ACCAAGCCCT TCATAAAACG 
AAAAGCTAAA CCACCCTGAT TTCGATTTTG TCTTCTTTGT ATCTTCGTTC TCCCTACTTT 
CTTATGCAAG CCTTTTCTTT TTATTATATC ACAGATAAGT ATTTCTTTCA CAATTGAATT 
GAACTTCCCA TCTATTTTCT ATAAATCCTA AATGCCATAA TGCTTTCAAT TCCTGTCATT 
TTGTGATATC ATGTAGAAGA AATGAACTAA TCCACAGTGG CTTATTCCAA GTATACCACT 
TGGGCTTTGG CAGTAG 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CTAATATAGA ATAATCACCG CCGTTGTGAA AGAACGATTG GATGATAATC CAATCGTTCA 
GGGAAATTGG AAGACCTTGG GTTTCCAATT TAGGCATGAG ACACCTTTGG TGGCTGCTGC 
r G TCCCTCAC AAGCTAAGGT GATTGTTGAA AAAGAGGAAA AAGGAGAAGA AATGAAACCA 
GTAATTTCCA TCATCATGGG CTCAAAATCC GACTGGGCAA CCATGCAAAA AACAGCAGAA 
GTCCTAGACC GCTTCGGTGT AGCCTACGAA AAG AAAGT TG TTTCCGCACA CCGTACACCA 

38 
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GACCTCATGT 


TCAAACATGC 


AGAAGAAGCC 


CGTAGTCGTG 


GCATCAAGAT 


CATCATCGCA 


360 


GGTGCTGGTG 


GCGCAGCGCA 


TTTGCCAGGC 


ATGGTAGCTG 


CCAAAACAAC 


CCTTCCAGTC 


420 


ATTGGTGTGC 


CAGTCAAGTC 


TCGTGCTCTT 


AGTGGAGTGG 


ATTCACTCTA 


TTCTATCGTT 


480 


CAGATGCCGG 


GTGGGGTGCC 


TGTTGCGACC 


ATGGCTATCG 


GTGAACTCTT 


TTTTAGGATA 




TAAAACAGGG 


TTCGGATAAG 


TTTTTTTGCA 


AGGTGGATGA 


TGGCTACATT 


GTAATGTTTT 


c ft n 


CCTTATTCTA 


ACTTAGTCTT 


AAGATAGGTC 


TTAAAACCAG 


GTGAAAAGCG 


AAGGCATGCT 


C C ft 


TTGGCAGCTT 


GTATGAGTAC 


CTACCGCAGA 


TGAAGGGAAC 


CCCGTTTGAC 


CATCCTTCCA 


720 


ACTAAATCAA 


TCTGACCTGA 


CTGATAAATA 


GAAGAATCCA 


GTCCAGCGAA 


AGCTTGTAAT 


780 


TGAGCAGGAT 


TATCAAAGGC 


ATGAATATTT 


CGAATCTCGG 


CTAAAATGAC 


CGCCCCTAAA 


840 


CGATTCTCAA 


TCCCAGTAAC 


CGTCGTGATG 


ACCGAGTTTA 


ACTCAGCCAT 


CAAGTCATTG 


900 


ACACATTTTT 


CCGCCTTGTC 


AATGAGCCTC 


TTGTAATGTT 


TGATGTTTTC 


ATTACACGAG 


960 


ATAAAACGTC 


TATGCGTTAT 


CAAACTCATT 


ACCAATTAAA 


ACAAATGTGG 


TTAGATCCTT 


1020 


TCGGAAATTG 


TCAAGCGATT 


GGAGGAAATG 


AACTAATCCA 


CAGCGGCTTA 


TTCCAAGTAT 


1080 


ACCACTTGGG 


CTTTGGCAGT 


AGCTAACTGC 


GCTAAATATA 


ATATAAGGAG 


GAGTAAAATG 


1140 


AAGACAGTTC 


AATTTTTTTG 


GCATTATTTT 


AAGGTCTACA 


AGTTCTCATT 


TGTAGTTGTC 


1200 


ATCCTGATGA 


TTGTTCTGGC 


GACTTTTGCC 


CAAGCCCTCT 


TTCCAGTCTA 


TTCTGGACAA 


. 1260 


GCGGTGACGC 


AGCTAGCCAA 


TTTAGTTCAA 


GCTTATCAAA 


ATGGGCAATC 


CAGAACTTGT 


1320 


ATGGCAAAGC 


CTATCAGGAA 


TTCATGGTCA 


ATCTTGGCCT 


GCTGGTTTTG 


GGTTCTATTT 


1380 


ATCTCTAGGT 


GTAATATAAA 


CATGTGTCTC 


ATGACGCGCG 


TGATTGCAGA 


ATCGACCAAC 


1440 


GAGATGCGCA 


AAGGTCTCTT 


TGGTAAGCTT 


GCTCAGTTGA 


CGGTTTCTTT 


CTTTGACCGT 


1500 


CGACAAGATG 


GCGATATCCT 


GTCTCATTTT 


ACCAGTGATT 


TGGATAATAT 


CCTCCAAGCC 


1560 


TTTAACGAAA 


GCTTGATTCA 


GGTCATGAGC 


AATATTGTTT 


TATACATTGG 


TCTGATTCTT 


1620 


GTCATGTTTT 


CGAGAAATGT 


GACGCTGGCT 


CTCATCACCA 


TTGCCAGCAC 


CCCATTGGCT 


1680 


TTCCTTATGC 


TGATTTTCAT 


CGTGAAAATG 


GCACGTAAAT 


ACACCAACCT 


CCAGCAGAAA 


1740 


GAGGTAGGGA 


AGCTCAACGC 


CTATATGGAT 


GAGAGCATCT 


CAGGCCAAAA 


AGCCGTGATT 


1800 


GTGCTAGGAA 


TTCAAGAGGA 


TATGATGGCA 


GGATTTCTTG 


AACAAAATGA 


GCGCGTGCGC 


1860 


AAGGCAACCT 


TTAAAGGAAG 


AATGTTCTCA 


GGAATTCTTT 


TCCCTGTCAT 


GAATGGGATG 


1920 


AGCCTGATTA 


ATACAGCCAT 


CGTCATCTTT 


GCTGGTTCGG 


CTGTACTTTT 


GAA 


1973 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTAAACCAAT AGTTGCTGAA GACCATAATC CAGCTGCTGT TGTTAAACCT TTAACTTTTT 60 
GATTAGTTAC CATGATTGTT CCTGCCCCTA GGAAACCCAA GCCACTAATT ACTTGGGCTC 120 
CCATTCGACT AGGATCACCG CTACCATAAC GACTAGTGAT GAACTGATTT GTCATCATTA 180 
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CAACACAAGT TCCCAAACAA ACTAGTAAGT 
CTCCTCGCTC ATAGCCAACA ATGCCACCGA 
TTTCCCAAAT ACTCAGTTCG TATGAAAGAT 
AATAGGCTTG ATAAATAAAG AGCTGCACTA 
AACATCATTG CCTGTGCATT TAAAGTTGCT 
CCCAATGGTT GGAAAAGTGG ATGGTACAAG 
GAATTAAAGT TTAACACAAC TACTGATAAT 
CGCACCATAG TGTAGAAACT AGATGCACCC 
TCTATACTGA AAAATACAGC CCGAATCATT 
TATGCAATAA GTAAAATAAT TACTGTTCCT 
GGTATATTAT AAGTGAACAT TAATCCTAGT 
ATCAGTGCAC CATACTCAAA GAATTTATCG 
GAAATTACTA TTGCTATAAT TGTAGCAATT 
ACTAGGTAAG GACGAAATGA TTGAGCATCT 
AATTTAGATA GATCTAAAGT TCCCGTCTTG 
ATTATCAAAA CTATTGGTAA CATATAGATT 
AGAATTATTC CATAATGGAG AAGAAAATTA 
AAACAGAAT 
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AGGTTCTAAT CCCTGCTGCT TGGTTTTTGA 240 

AAAGAATCGC TAAAAAGCAC CTAAGAAGTA 300 

TCATATTATC TCTTACCTCG TTTACCTTGG 360 

GACATAATCA TTAAAATTAC AGAATAAACA 420 

GTTTCATCAG TAGACTGTTT AATAACGATT 480 

AATACAGATA AGTCATAGTC AGATAATAAA 540 

ACGACTGGTA AAATATACGG TATAATAACT 600 

ATACTACGTG CAGCTTCTTC CATATCATTA 660 

CTATAAGAAA ATGGTAATTT TTGAATAGTA 720 

ACTAAAACTA AATTAAATAG TATTAAATGA 780 

GCAATCAATG TACCTGGCAA TATCCATGGT 840 

AACTTACTCT TGTTTTTATG TACAATACGT 900 

ACCGCAGCTA AAATTGCATA AACAACGCTG 960 

GTAAATAAAT TAGCATAGTT CGCTAACGTA 1020 

ATCGTCAACG AGTCTGTAAA CGAGTATAGA 1080 

GCAAACAATA CATACGCAGC AATGTGAGCA 1140 

TCTGCTTCCT TAATACTAGC CTGGTTTTCG 1200 

1209 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 



CTAAGTCCTT 


TTAGTTTTAT 


CTTAATTCTC 


TTATTGTTGT 


AATAATCAAT 


ATAGTCTATA 


60 


ATGGCTTGTT 


CCAATTGCTT 


AAGCGACTGA 


AACGACTTCT 


CATAACCGTA 


AAACATTTCC 


120 


GATTTCAGAA 


TCCCAAAGAA 


AGATTCCATC 


ATACCGTTGT 


CTTGGCTGTT 


TCCCTTGCGT 


180 


GACATAGATG 


CTTGAATTCC 


CTTACTCTTA 


GGAACCGATG 


ATAAGAATCG 


TGTTGGTATT 


240 


GCCAGCCTTG 


GTCACTATGG 


AGAATCGTAT 


TCTCGTAGTG 


CTTCTCTGTG 


AATGCCTGTT 


300 


CCAACATTGT 


TTGTACTTGT 


TCTAAGTTGG 


GTGAAGTTGA 


AAGATTATAG 


GCGATAATTT 


360 


CGCTATTAAA 


GCCATCTAAA 


ACTGGTGATA 


AGTAAAGCTT 


TTGAGTACTT 


GCTGGAATGG 


420 


CAAATTCTGT 


CACATCTGTG 


TAGCACTTTT 


CCATTGTTTT 


AGAGCCTTCA 


AATTGGCCTT 


480 


GAATGAGATT 


CTCTGCCTTC 


TTACCAACGT 


CTCCTTTATG 


AGAAGATTTC 


GTTTCTGTCG 


540 


CATTTTAGCT 


TGTAAATTGA 


GTACTTTC AT 


CAAGCCTTGA 


ACTCTTTTAT 


GATTTACCAG 


600 


ATAAGCACGA 


TTCCTTAGTT 


CTAAATGAAT 


ACAGCGATAA 


GCATAATTTC 


CCTTGTGTTC 


660 


GATAAAAATG 


GATTGAATTT 


CAGCTTTAAG 


CTCTTGGTCC 


TTATCTGGTT 


TGTNTAG 


717 
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(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 851 base pairs 
(B) TYPE: nucleic acid 
<C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CTAAATCACC AGATTTAACC AAGTCGTAGA TACGGCCTGG TGTTCCAATA ACAATATGAG 60 

GCTGATTGCT TGCCAATTTC TCAATCTGGC GAGCCTTATC CGTACCACCC ACATAATTAA 120 

CCACACGAAC TTCGACATCT GAGTGAGCTG AAATCTGACG CGCTACTTGG TAAATTTGAG 180 

TAGCCAACTC ACGACTCGGT GCAGTAATCA CTGCTTGTAC ACTATCGCTA GCTTCATCTA - 240 

ATTGCTGGAA AATCGGTAAC AAGAAAGTAT GAGTCTTACC TGAACCTGTT TTTGATTCTC 300 

CTACTAGGTC ACGACCTGCC AAAACAATAG GAATCAACTT GTCTTGCACC TCTGTTGGAG 360 

TTGTAAATTT TAACTCCTCC AAGGCTTCTC TAATATAGTT TTTAAATTGA AATTTCGTAA 420 

ATGACATAAC ATCCTCGATT CTATCTATCT TATCAATTAT ACCATATTTT ATTCCATTAC 480 

AGTAGTCTCA CTTATTTAGG CTATTTCCAG TAGCTTCTCT AGTAAGAAAA GGCTGGAATT 540 

TTATAGTTCC AACCTCTTTT CAGTTATTAT TTCCAGTTTA ACATAGCATT CAAGCCATAG 600 

TGATCACTCA CTTGTGGACT CTTGTTACCA TCAAATACGA CATGTAAATT TTCCACCGCT 660 

AACTCTTTGG TAGTAAAGAC ATAATCGATT CGAAGGGGTT CAGTGTTCCC TTTCCAGCCA 720 

TCAATTTCAG GCGGAACAGT ATAGCTACCA CTTTTCTCTT GAGCAACTTC AAATGCGTCT 780 

TGTAAGCCTA ATGGACTAGC TAAAATAGCT TGGTAACCTT CCCTGACCTG CTGGGTTGTT 840 

AAAATCTCCA G 851 

£2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



60 



CTGTTGAGAT TGTTACGAAA TAACTGAAAA TATTTAAGGA GAAAATATAT GTTGAAACAC 

TTAAACTTAA AAGGTCACTT ATTGACAGCC ATTTCCTATA TGATTCCAAT TGTTTGTGGT 120 

GCAGGATTCT TAGTTGCCAT TGGTTTAGCA ATGGGGGGTG GTGTTCCTGA CGCTCTTGTA 180 

GCAGGAAAAT TCACTATCTG GGATGCTTTA GCAACTATGG GTGGTAAAGC CCTTGGTCTC 240 
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TTGCCAGTTG TTATTGCTAC AGGTTTGTCT TACTCGATTG CTGGTAAGCC AGGGATTGCA 300 
CCAGGTTTTG TTGTTGGTCT AATTGCCAAT TCTGTTGGTT CAGGGTTTAT CGGTGGTATC 360 
TTGGGAGGTT ATATAGTTGG TTTCTTGGTT CAAGCGATTA TTAAAAAGGT CAAAGTACCA 420 
AACTGGATTA AAGGTTTAAT GCCAACCTTG ATTATTCCTT TTGTACCTCT TTGGTAAGTA 480 
GTTTGATTAT GATTTATATT ATTGGGGCGC CTATCGCAGC CTTTACCAAC TGGTTGACGA 



G 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



540 
541 



CTGGAATGAG TGTATAAGCC CAGCCCAAGT TTTGCATCCG TTCAAAGTTC CAAGACCCTT 
GTAAGAAGGT TGAACGCCAC CAAACTTTTT TACGATCTGA TTTAGTTAAT TGAAGTTTTT 
CAGTCATGAT GTTTTCAGTC CTTTCTTATC TTAGTAGTCT TCTAGGATAT CGCCGATTGG 
GTCGTTAGAA GTTGCGGCTC CTCCGCCACC ATTTCCACCA GTTTTAGAAA GGTGAAGGTA 
GATAAGAGCG ATAGCAACAC CGATAGCACC GAATCCGATT AGAGTAATAT CTGACACAGC 
AGCGAGAACG AAACCAAGAG CGAAGAATGG C CAT ACTTC A CGAGTTGCCA TCATGTTGAT 
AACCATGGCG TAACCAACGG CAACGACCAT ACCACCACCG ATAGCCATAC CATCTTTGAG 
CCAGTCTGGC ATGGCACTAA GGATACTTTG TACAGTTTCA GTTGGTACCA TAAGGAGAAG 
AGCTGCAGGA AGCGCGATAC GAAGTCCTTG GAAAAGTAGC GCAATGAAAT GCGCACGCTC 
CACAGCGCCG AAGTCACCTT TTTTAGCGGC AGCATCTGCA GTATGAACCA AACCAACTGA 
AATTGTACGA ACAATCATTG TCAAGAAAAG TCCAGCTACG GCAAGAGGGA TAGCAACCGC 
TTGGGCAACA CCGATACCAG TCTTGGTAAA GTCACCACCA AGAACCATGA TAATGGCAGC 
AGCGACAGAA GCAAGTGCAG CATCAGGAGC GATAGCAGCA CCGATATTTG ACCAACCAAG 
GGCAATCATT TGAAGCGATC CACCGAGGAT AATCCCTGCT TCCAAGTGGA CCTGTTACAA 
GCCCAATAAG GGTACAGGCT ACAAGTGGTT GGTGAAATTG GAACTGGTCG AGGATGCCTT 
CAAGACCTGC AAAGGAAGGC TACAACGACT ACTAAAACCA TAGAAATAAT AGACATGTTT 
AAAATCCTTT CATAAATAAT GGCTTATTTG ACATTGGCTT TGTTAATCAA GTCAAACAAA 1020 
TCTTTTTTAG AATCATTTGG TACTTTACGG ACATCAAATT CAACACCCAA GTCACGCATT — 
TTTTCAAATG TAGCAACATC TTCTTTGTCC ATAGACAAAA CGGTATTGAC CAATGTTTTA 
CCTGTTGAGT GAGCCATAGA ACCAACGTTA AGAGTCTTGA TTGGCACGCC GCCTTCGATG 
GCACGAAGGG CATCTTGAGG TGTTTCAAAC AAGATAAGGG CATGTGTTTC TCCAAAACGT 
GGGTCTTTTG AAATATCAAT CAGTTTTTGA ATTGGAACCA CGTTAGCCTT GACATTACCT 



60 
120 
180 
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420 
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540 
600 
660 
720 
780 
840 
900 
960 



GGAG 



1080 
1140 
1200 
1260 
1320 
1324 



(2) INFORMATION 7 OR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



TCATAAGGAA 


GCTGTCGCTC 


GTTCCGCTAA 


GGTATGGACA 


CCACGGTGAA 


CATTGGCATT 


60 


GTCCTGCTCA 


TAGTAACTGT 


TAATAGCTTT 


CAGAACTACT AGTGGTTTTT GTGTCGTCGC 


120 


AGCATTGTCC 


AGATAGACCA 


GAGGTTCATC 


ATTGACAATC 


TGATCTAAAA 


TTGGAAAATC 


180 


CTTGCGAATC 


GCTTCTACAT 


CTAACATAGG 


CTTCCCCTTA GCGTTTTGAC 


AATTTCTCTT 


240 


CGATAGTTGC 


AATCATTTCA 


TCACGAACTT 


CCTTGACTGG 


AATCTCCACG 


ATAACAGATC 


* 300 


CAAGGAAACC 


ACGAACAACC 


AAACGCTCTG 


CAGTTGCCTT 


ATCCAATCCA 


CGACTCATGA 


360 


GGTAATACAT 


GTCTTCTGGA 


TCAACTTGTC 


CGATAGACGC 


TGCGTGTCCT 


GCAGTGACAT 


420 


CATTTTCATC 


AATCAAAAGA 


ATTGGGTTAG 


CATCTGAACG 


CGCTTGGTCT 


GAAAGCATGA 


480 


GAACACGGCT 


CTCTTGTTGC 


GCATCTGCTC 


CCTTAGCACC 


CTTGATGATG 


TGGCCGATAC 


540 


CATTGAAAGT 


CAAAGTTGCT 


TTTTCAAGGA 


TAACCCCATG 


TTGTAGGATA 


TTTCCGATAG 


600 


AGTTGCAGCC 


ATAGTTAGTT 


ACACGAGTAT 


CAATCCCTTG 


TACCTGACGA 


CCACTTGAAA 


660 


GAG 
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(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 649 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CTAGTTGGAT 


GGCTTCAATA 


AAGGATGATT 


TGGCTGCTCC 


ACTATTGGCA 


ATGAGCTGAA 


60 


AACAGATATA 


TTCCATTTCT 


TCGTCATCTT 


ATTTCTCCTA 


TCCATTCAAG 


TGCTTGTTCC 


120 


AGAACTTTTG 


CTCCATTCAT 


CATTCCGTAA 


TCCCGCATAT 


CAATGGTATC 


TACAGGGATA 


180 


TTTCCTGCAA 


TTTCTTTCAC 


AGCAAGTAAC 


TCATAACGAA 


TTTGTGGCCC 


AATTAGAATG 


240 


ACATCTGCTT 


CATGGATATT 


CTTTTTAGCT 


TCTGTCATTG 


ATTTTGCTTG 


GATAGAAATT 


300 


TCAATCCCAC 


GTTCAGTCGC 


ACTTTGTTGC 


ATTTTTTTAA 


CAAGCATACT 


TGTCGACATT 


360 


CCCGCATTAC 


ATACTAATAA. 


AATTTGTTTC 


ATAATCTTAA 


CCTTCCATTT 


CTTGTTCAAC 


420 


AACTTTGTCA 


TTAACTTTG A 


TAAATGGAAT 


GTATAGAAGA 


ACTCCAAGTG 


CAAAGATGAT 


480 
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GAATTGAACT AGAACTGCTC TCACGTCCCC TGCTGTTGCT AACCATGCAT TTAAGAATAC 540 

TGGTGTAGTC CAAGGAACTT GTATAAATGC AGGACTCATG AATCCTGTAA CTGTTGCTAA 600 

GTAGCTGATT AAAATACCAA GGACTGGAAC TGTGATAAAT GGAATAGTC 649 

(2) INFORMATION FOR SEQ ID NO: 17: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CTTGGTATTA TTTGAAATCA GATGGTTCTT ATGCAAAAAA TGCATGGCAA GGAGCTTATT 60 

ACCTTAAATC AAACGGTAAA ATGGTACAAG GTGAGTGGGT TTATGATTCT TCTTACCAAG 120 

CCATGGTATT ACTTGAAATC AGATGGTTCA TATGCTCGCA ATGCATGGCA AGGAAACTAC 180 

TATTTGAAAT CAGATGGTAA AATGGCTGTC AATGAATGGG TTTATGATGC CACCTATCAA 240 

GCATGGTATT ATTTGACATC AGATGGTTCT TATGCTTACA GTACATGGCA AGGAAATTAC 300 

TATCCTAAAA TCGGATGGTA AAATGGCTGT CAATGAATGG GTTGATGGTG GACGTTATTA 360 

TGTTGGCGCT GACGGAGTTT GGAAGGAAGG TCAAGCAAGT ACAGCTTCTC CTAGTAATGA 420 

TAGCAATAGT GAATATTCCT GCTGCTTTAG GAAAGGCAAA AAGTTATAAT TCGTTATTCC 480 
ACATGTCAAA AAAAACG 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



497 
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180 



CTAGTCTAAA TTTTTTAAAA CAAAGGTCAA AGATAGTCAA TATCAGTAAT CATAACTAAG 
TAAACAAAAA GAGGTAAAGA ATATGAATAA CAACTTTAAT AATTTTAATA ACATGGATGA 
TTTATTTAAC CAATTGATGG GTGGTATGCG AGGATACAGT TCTGAAAATC GCCGTTACTT 
AATTAATGGA CGCGAAGTCA CACCTGAGGA ATTTGCTCAC TATCGTACGA CTGGTCAATT 240 
ACCAGGAAAT GCAGAAACTG ATGTGCAAAT GCCACAACAG GCATCAGGTA TGAAACAAGA 300 
CGGTGTCCTT GCAAAACTAG GTCGAAAC TT GACAGCAGAA GCGCGTGAGG GCAAGTTGGA 

64 
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TCCTGTTATC 


GGACGAAACA 


AGGAAATTCA 


AGAAACATCT 


GAAATCCTCT 


CACGCCGCAC 


420 


CAAGAACAAT 


CCTGTTTTGG 


TCGGAGATGC 


AGGTGTTGGT 


AAGACAGCAG 


TTGTCGAAGG 


480 


TCTAGCGCAA 


GCCATTGTGA 


ACGGAGATGT 


TCCTGCTGCT 


ATCAAGAACA AGGAAATTAT 


540 


TTCTATTGAT 


ATCTCAGGTC 


TTGAGGCTGG 


TACTCAATAC 


CGTGGTAGCT 


TTGAAGAAAA 


600 


TGTCCAAAAC 


TTAGTCAATG 


AAGTGAAAGA 


AGCAGGGAAT 


ATTATCCTCT 


TCTTTGATGA 


660 


AATTCACCAA 


ATTCTTGGTG 


CTGGTAGCAC 


TTGTGGAGAC 


AGTGGTTCTA AAGGGCTTGC 


720 


GGATATTCTC 


AGCCAATCGA 


TCTCTCTCGT 


GGAGAATTGA 


CAGTGATTGG 


GGCAACAACT 


780 


CAAGACGAAT 


ACCGTAACAC 


CATCTTGAAG 


AATGCTGCTC 


TTGCTCGTCG 


TTTCAACGAA 


840 


GTGAAGGTCA 


ATGCTCCTTC 


AGCAGAGAAT 


ACTTTTAAAA 


TTCTTCAAGG 


CATTCGTGAC 


900 


CTCTATCAAC 


AACACCACAA 


TGTCATCTTG 


CCAGACGAAG 


TCTTGAAAGC 


AGCGGTGGAT 


960 


TATTCTGTTC 


AATACATTCC 


TCAACGTAGC 


TTGCCAGATA 


AGGCTATTGA 


CCTTGTCGAT 


1020 


GTAACGGCTG 


CTCACTTGGC 


GGCTCAACAT 


CCAGTAACAG 


ATGTGCATGC 


TGTTGAACGA 


1080 


GAAATCGAAA 


CGGAAAAAGA 


CAAGCAAGAA 


AAAGCAGTTG 


AAGCAGAAGA 


TTTTGAAGCA 


1140 


GCTCTAAACT 


ATAAAACACG 


CATTGCAGAA 


TTGGAAAGGA 


AAATCGAAAA 


CCACACAGAA 


1200 


GATATGAAAG 


TGACTGCAAG 


TGTCAACGAT 


GTGGCTGAAT 


CTGTGGAACG 


AATGACAGGT 


1260 


ATCCCAGTAT 


CGCAAATGGG 


AG 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



CTTGATATAA 


TTCTTGTTTC 


GAAAGATTCT 


CCTTAGGTAT 


ATCTATTCCT 


CCACTAGTAA 


60 


ACGGTAATTC 


CAAAACAGAG 


TTTACTTCGT 


TAAATGTAAG 


CCAATATTTA ACTTTATCTT 


120 


TATACCTTTC 


TAAAACTGTT 


CGAGCAAATT 


TTTCATAAAA 


ATGAATCATT 


CTTCCTATCA 


180 


ATCCATCCAT 


GATATTTTCT 


TGCTAAATAT 


AATGGAGTCT 


CATAGTGTGA 


AAGAGTTACA 


240 


AGTGGTTCTA 


TCCCGTGAGC 


ATGTAGTTCA 


TCAAACAATT 


CATCATAATA 


TTTCAATCCA 


300 


GCTTCGTTAG 


GTTCTTCCTC 


ATCTCCTTTT 


GGAAAAATTC 


TACTCCATGC 


AATAGAAGTA 


360 


CGAAAAACAT 


TAAAGCCCAT 


TTCAGAAAAC 


AAGGATATAT 


CTTCCTTATA 


TTTATGATAA 


420 


AAATCAATAC 


CTATCAATTT 


TAAGTTATCT 


TCTGTAGGAT 


TTTCTGTTGC 


TTCTCCTAAT 


480 


CCACCTTTGG 


GTAACACATC 


CTGAACTGAT 


AAGCCCTTAC 


CATCTTCATT 


ATATGCTCCC 


540 


TCTACTTGAT 


TAGCTGCAAC 


AGCTCCACCC 


CAAAGAAAAT 


CATCTGGAAA 


AATGGTCATA 


600 


ACTTTCCTCC 


ATTATAATAT 


TACCAGTAAT 


TCCTTAGAAA 


TGCTCGATTG 


TCTGATTATT 


660 


AGGTAATATT 


AATACATCTA 


GAAAATCATT 


GGTATTCGTT 


ACAATTACTG 


GTGTAACTGT 


720 


TTCGTAGCCT 


TTAGTCTTGA 


TTAAATTCAA 


GTCCATTTCA 


AAAATCAACT 


GATTTTTGAA 


780 


AACTCTGTCT 


CCTTCTTCTA 


CATGACTAAT 


AAAACCTTGA 


CCTTTTAG 




828 
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(2) INFORMATION FOR SEQ ID NO: 20: 



PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AGCTATGGTC AGACTCAGAT TGATGGCGTT GCTTATGCCA AGTACGATAT CTTCCGTTTA 60 
AAGAACGGGA AAATTGTGGA GCATTGGGAT AATAAGGAAG TCATGCCTAA GGTAGAAGAC 120 
TTGACCAATC GAGGGAAGTT TTAAATTGAG GACAAAGAAT GATTGAATAC AAAAATGTAG 180 
CACTGCGCTA CACAGAAAAG GATGTCTTGA GAGATGTCAA CTTACAGATT GAGGATGGGG 240 
AATTTATGGT TTTAGTAGGG CCTTCTGGGT CAGGTAAGAC GACCATGCTC AAGATGATTA 300 
ACCGTCTTTT GGAACCAACT GATGGAAATA TTTATATGGA TGGGAAGCGC ATCAAAGACT 
ATGATGAGCG TGAACTTCGT CTTTCTACTG GTTATGTTTT ACAGGCTATT GCTCTTTTTC 
CAAATCTAAC AGTTGCGGAA AATATTGCTC TCATTCCTGA AATGAAGGGG TGGAGCAAGG 
AAGAAATTAC GAAGAAAACA GAAGAACTTT TGGCTAAGGT TGGTTTACCA GTAGCCGAGT 
ATGGGCATCG CTTACCTAGT GAATTATCTG GTGGAGAACA GCAACGGGTC GGTATTGTCC 
GAGCTATGAT TGGTCAGCCC AAGATTTTCC TCATGGATGA ACCCTTTTCG GCCTTGGATG 
CTATTTCGAG AAAACAGTTG CAGGTTCTGA CAAAAGAATT GCATAAAGAG TTTGGGATGA 
CAACGATTTT TGTAACCCAT GATACGGATG AAGCCTTGAA GTTGGCGGAC CGTATTGCTG 
TCTTGCAGGA TGGAGAAATT CGCCAGGTAG CGAATCCCGA GACAATTTTA AAAGTGCCTG 
CAACAGACTT TGTAGCAGAC TTGTTTGGAG GTAGTGTTCA TGACTAATTT AATTGCAACT 
TTTCAGGATC GTTTTAGTGA TTGGTTGACA GCTACAATGA CATTGGTCGG TTCCTTGAGC 
AAGAGATAGA TTAGCCAGAC AGTCATGCCC AAAATCCCTC CAGGTAAGAC CATAGACCGT 
TGCACATTAA GTACGATTAA AAAAGTGATA ATGGCAAGAA AACTTGCTAC TGCTTGTAAT 
AAAAAGGTTG TTAGTGTCAT ATTAGTTCAT CAATACCAAG GCGACAGAAG TTCCTGCCCC 
TAAAGCGAGG GTAATGAGCA GGGATTCAAA CATCTTACTC ATACCAGAGT TTATGTGGTT 
GGTCATAATA TCACGGACCG CATTGGTCAA GGCAATACCT GGTACAAACG GCATGACCGC 
ACCAGCTATA ATCAAATCCT GCCCGTTTGA ATGGAAAAAC CCTGTGTTAG CCGAGCCCCA 
AAACTGGGGC CAATTTATCC CCCAAAGACA AAAGCTCCAT CAAAGGCTGT CACAAAGGGA 
ATTCGGATAA ATTTTCCACA TAGAAGGAAA AGGCAAAACC AAATAAGGTC GCCACTCCTG 
CCCCAAGTGC TCGTAAATAT TCCGCT 

(2) INFORMATION FOR SEQ ID NO: 21: 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1466 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pair3 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



AAGGTTCTTC GTAAATGTAC ACCATTCGGA TGCTCAAAAT GGGTTCTGAA GCTGCTGCTA 60 

AATCTGCACA AGAACACGGT CTTAAATCAG TTGAAGTTAC TGTAAAAGGT CCAGGTTCTG 120 

GTCGTGAGTC AGCTATTTCG TGCGCTTGCT GCCGCTGGTC TTGAAGTAAC AGCAATTCGT 180 

GATGGGACTC CAGTGCCAAC ACAATGGTGC TTCGTCCTCC AAAACGTCGC CGTGTATAAT 240 

CATCGCATTA CACTGCTTTT CGTTTAAGAG GGAGTAACTA AATGATCGAG TTTGAAAAAC 300 

CAAATATAAC AAAAATTGAT GAAAATAAAG ATTATGGCAA GTTAGTAATC GAACCACTTG 360 

AACGTGGCTA CGGTACAGCT CTTGGTAACT CTCTTCGTCG TGTACTTCTA GCTTCTCTAC 420 

CAGGAGCAGC TGTGACATCT ATCAACATTG ATGGTGTGTT ACATGAGTTT GACACAGTTC 480 

CAGGTGTTCG TGAAGACGTG ATGCAAATCA TTCTGAACAT TAAAGGAATT GCAGTGAAAT 540 

CGTACGTTGA AGACGAAAAA ATCATCGAAC TGGATGTTGA AGGTCCTGCT GAAGTAACAG '600 

CTGGTGACAT TTTGACAGAT AGCGATATTG AAATTGTAAA TCCAGATCAT TATCTCTTTA 660 

CAATTGGTGA AGGTTCTTCT CTAAAAGCGA CTATGACTGT TAACAGTGGT CGTGGATATG 720 

TACCTGCTGA TGAAAATAAA AAGGATAATG CACCAGTTGG AACACTTGCT GTAGATTCTA 780 

TTTATACACC AGTTACAAAA GTCAACTATC AAGTGGAACC TGCTCGTGTA GGTAGCAATG 840 

ATGGTTTCGA CAAATTAACC CTTGAAATCT TGACAAATGG AACAATTATT CCAGAAGATG 900 

CTTTAGGGCT TTCAGCACGT ATTTTGACAG AACATCTTGA TTTGTTTACA AATCTTACTG 960 

AGATTGCTAA GTCAACTGAA GTGATGAAAG AAGCTGATAC TGAATCTGAC GACCGTATTT 1020 

TAGATCGTAC GATTGAGGAA CTGGACTTGT CTGTGCGTTC ATACAACTGT TTAAAACGTG 1080 

CCGGTATCAA TACTGTGCAT GATTTGACAG AAAAATCTGA AGCAGAGATG ATGAAAGTAC 1140 

GAAATCTTGG ACGCAAGAGT TTGGAAGAAG TGAAACTCAA ACTCATTGAT TTGGGTCTTG 1200 

GATTAAAAGA TAAATAAAGG AGGAATACAT GGCTTACCGT AAACTAGGAC GCACTAG 1257 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



CTATTGAAAC AAGAAAAATA GAGAATC^AA GAAAGAGAAC TTATGAATAT TCAAGAAGAA 60 

ATTAAGAAAC GTCGTACCTT TGCCATTATC TCCCACCCGG ACGCGGGGAA AACAACCATC 120 

ACTGAGCAGT TACTCTAACT TTGGGGGTGA GATTCGTGAG GCTGGTACGG TAAAAGGGAA 180 

GAAAACAGGG ACTTTTGCTA AATC'IVSACTG GATGGATATC GAGAAGCAAC GTGGGATTTC 240 
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TGTTACTTCA TCTGTTATGC AATTTGACTA 
GCCAGGGCAC GAGGACTTCT CAGAAGATAC 
GGTCATGGTC GTGGACTCTG CCAAGGGGAT 
TGTGAAACAT CGTGGCATTC CAGTCTTTAC 
TGAGCCTTTG GATCTCTTGC AAGAATTGGA 
GAACTGGCCT ATCGGGATGG GGAAAGCCTT 
TTTAGAGCTT TACAAAGGGG ATGAGCGTTT 
TTGGTAGCAA TCCTTTCTAC GAGCAAGTCA 
GGAATGAGTT TTCAGAGGAA GCTATTCTGG 
CAGCCCTGAC AAACTTTGGT GTGCAGACCT 
AACCACATGG TCACAAGAAA ACAGACGGTG 
CAGGCTTTGT CTTTAAAATC CAAGCCAACA 
TTGTCCGTAT CGTATCAGGC GAATTTGAGC 
GTAAGGGTGC CAAACTATCT AATGTTACCC 
CCAATGCCGT AGCAGGTGAT ATTATCGGGG 
ATACCTTGAC GGTTGGAAAA AACAAGTTTG 
AAATTTTCAT GAAAGTTTCT GCTAAGAATG 
TTGAGCAATT GGTGCAAGAA GGAGCCGTTC 
ACATGCTGGG AG 



PCT/US97/05306 

CGACGGCAAG CGCGTGAATA TCTTAGACAC 300 

CTATCGTACC TTGATGGCGG TGGATGCTGC 360 

CGAGGCTCAA ACAAAGAAAT TGTTTGAGGT 420 

CTTTATGAAC AAGTTGGATC GTGACGGTCG 480 

AGAAATCTTG GGCATTGCTA GCTACCCTAT 540 

TGAGGGCTTG TATGACCTCT ATAACCAACG 600 

TGCTAGCCCT AGAAGATGGA GACAAACTTT 660 

AGGATGACAT TGAGCTTTTA AATGAAGCTG 720 

CTGGAGAATT GACGCCTGTC TTTTTCGGTT 780 

TCCTTGAAAT CTTCCTCAAG TTTGCTCCAG 840 

AAATTGTGGA TCCTTATGAC AAGGATTTCT 900 

TGGATCCTCG TCACCGTGAC CGTATTGCCT 960 

GTGGCATGAG TGTCAATCTC CCTCGTACTG 1020 

AGTTTATGGC GGAGAGTCGT GAGAATGTGA 1080 

TTTACGATAC CGGTACTTAT CAGGTTGGGG 1140 

AATTTGAACC ACTGCCAACC TTTACTCCTG 1200 

TTATGAAGCA AAAATCCTTC CACAAGGGGA 1260 

AGCTTTATAA GAATTACCAA ACAGGTGAGT 1320 

1332 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



60 
120 
180 
240 
300 
360 



TTGGACGCTG GAACAAATCC AAGCTGACCT GTTCCAAGAC CAGACTTGGT ATGCTCTGGC 
TTATGATGGG GCAGAAGTGA TTGGCTTTCT AGCTGTTCAG GAGACTCTTC TTTGAAGCAG 
AAGTCCTGCA AATCGCTGTC AAAGGAGCCT ATCAGGGTAA GGGCATTGCG TCAGCCTTGT 
TTGCTCAATT GCCGACAGAC AAGGAAATTT TCCTCGAAGT CAGACAGTCA AATCAACGAG 
CGCAAGCATT TTACAAGAAA GAAAAGATGG CAGTTATCGC TGAGCGAAAG GCCTACTACC 
ATGACCCAGT CGAGGACGCC ATTATCATC-A AGAGAGAAAT AGATGAAGGA TAGATATATT 
TTAGCATTTG AGACATCCTG TGATGAGACC AGTGTCGCCG TCTTGAAAAA CGACGATGAG 420 
CTCTTGTCCA ATGTCATTGC TAOTCAAATT GAGAGTCACA AACGTTTTGG TGGCGTAGTG 480 
CCCGAAGTAG CCAGTCGTCA CCATCTCGAG GTCATTACAG CCTGTATCGA GGAGGCATTG 540 
GCAGAAGCAG GC3ATTACCGA AGAGGACGTG ACAGCTGTTG CGCTTACCTA CGGACCAGGC 600 
TTCGTCGGAG CCTTGCTAGT TGGTTTGTCA GCCGCCAAGG CCTTTGCTTG GGCTCACGGA 660 
CTTCCACTGA TTCCTGTTAA TCACATGGCT GGGCACCTCA TGGCAGCTCA GAGTGTGGAG 720 
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CCTTTGGAGT TTCCCTTGCT AGCCCTTTTA GTCAGTGGTG GGCACACAGA GTTGGTCTAT 
GTTTCTGAGG CTGGCGATTA CAAGATTGTT GGAGAGACAC GAGACGATGC AGTTGGGGAG 
GCTTATGACA AGGTCGGTCG TGTCATGGGC TTGACCTATC CTGCAGGTCG TGAGATTGAC 
GAGCTGGCTC NTCTNGGGCA GGANATTTAT GA 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



CTAAACTTTC 


GCTCATAGGC 


ATACAAATTA 


ATCCTTTGGC 


ATAAGTAGCC 


ATAAAATTAA 


. 60 


CATTTTCTGT 


TGTAGCTGCT 


TGTGCAGAAC 


AAATTAAGTC 


TCCTTCATTT 


TCTCTATCCT 


120 


TGTCGTCTAT 


AACAAGAACA 


AGTCGTCCCT 


TCTGCAATGC 


TTCTAATGCT 


TCTTGTATTT 


180 


TTCGATATTC 


CATTGACTGA 


TTATCCTTTC 


TGCTAAAATC 


CATTTTGATA 


TAATAGTTCC 


240 


TTCGATATTT 


CTGATTTTGG 


AGAGTTATCC 


ATCAGTTTTT 


GCACATATTT 


ACCTAAGATA 


300 


TCATTTTCAA 


GATTTACTGT 


ACTCCCGACT 


TGTTTACTCT 


TAAGAATGGT 


TTGTTTCAAG 


360 


GTATGAGGGA 


TAACAGATAC 


TGAAAAGTTT 


ACTTTGGAGA 


CTTTAGCGAC 


AGTCAGACTA 


420 


ATGCCGTCAA 


TTGTAATAGA 


TCCTTTTTCA 


ACTATTAAAT 


CTAAAATTTC 


TTTTTGTGTG 


480 


TTGATTTGAT 


ACCATACAGC 


ATTATCATCT 


TTTTTTATTG 


ACGAGATTTT 


TCCTGTACCA 


540 


TCAATGTGTC 


CTGTAACGAC 


GTGACCCCCA 


AGTCGACCGT 


TGACAGATAA 


GGCTCTTTCT 


600 


AGATTCACCT 


CACTTCCATG 


TTTTAATAGA 


GTAAGAGCTG 


TTCGACTCCA 


TGTTTCATTC 


660 


ATTACATCAA 


CTGTAAAGGA 


TTGATGATTG 


AAATGAGTAA 


CTGTAAGACA 


GATACCATTT 


720 


ACTGCTATAC 


TATCGCCTAA 


ATGGATATCC 


GTTAATATTT 


TTGAGGCTTT 


AATTGATAGT 


780 


TTACAATTAC 


GAGAGTCTTT 


CTGTATTCCT 


TCAACTTTTC 


CGATTTCTTC 


AATTATTCCT 


840 


GTGAACATGG 


ATAAATCACT 


TCACTTTCTA 


TGAGATAGTC 


NTTTCCTNTT 


TGAGAAAAAG 


900 


CATAAGGTTT 


CAATCTAATA 


GCGTCATTTG 


GCAAAGAAAT 


GCCTTCACCT 


CCGACAGGAA 


960 


ACTTGGCACT 


GCCTCCAAAA 


ATTTTTGGTG 


CAATATATAT 


TTTCAGCTCA 


TCAACAATTT 


1020 


GTTGTTCCAA 


AGCACTCCAA 


TTCATTAGAC 


TGCCCCCCTC 


TACAACTAGG 


CTATCAATCT 


1080 


GCATGTTTCC 


TTAGATGTTN 


GCATTAAACT 


CNGATAAGTC 


TATATGATTG 


CCTTTTTTCT 


1140 


TTATGGAAAG 


TATTCCCC 










1158 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



WO 97/37026 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 25: 

TAAAGGCTGT TTGGATAATG TAACATAAAG AAATCTTGTT TAGCAGGGAT TGAATAGAGT 60 

TCCTGAAGTG ATGACTTCTT TATTAGACAT AATGTCTGGA CCCAAGATTA AAAGAATGGC 120 

CCCGAAGAAT ACGAGCATCA AAGGTAGCAG ATGCAACAAC TGTATCGTGG AAGATTGAGA 180 

GGAACTTAGG TAATTTAAGA TTGTCTAAAC TTTCTTCTTT CTTACCAAAG CGTCCTGCTA 240 

CTTTATCTAC AAACCAGATT GCAAATTGCT GTTGGTGACC AATCGCAAAT CCGCCACCAC 300 

CAGTCAAGCG TTGAGTTGCC TCAACAGTCA TATTTGAACT AACTGCCCAG TAAAGTCCAC 3 60 

AGATGATACC AATCGCTGCT GTACCGTAAG CATTGCGCAA TTGTGGTACT AAGAATAGAA 420 

CCATAAGAGA TACTGTTGCA GCTAGCCATG CTGATCAATT CCCTCAACCA GTCCTATATC 480 

TCAGCCAAGA GAATCGAGGA GGTCTTTGCG GAATCTCCCG AAAACATCCA TTCAGAATTA 540 

GAACAAAAGC AAGTTACCAG TGGTCGGGTT TTACAAGTCC AAGAATTGAC TTTTACCTAT 60 0 

CCTGATGCGG CCCAGCCTTC TCTGAGAGAC ATTTCCTTTG ATATGACTCA AGGACAAATC 660 

CTTGGTATCA TTGGGGGGAC TGGTTCTGGT AAATCAAGCT TGGTGCAACT CTTACTTGGA 720 

CTTTATCCAG TAGACAAGGG GAACATTGAC CTTTATCAAA ATGGACGTAG TCCTCTTAAT 780 

TTGGAGCAGT GGCGGTCTTG GATTGCCTAT GTACCTCAAA AGGTCAAACT CTTTAAGGGA 840 

ACTATTCGTT CCAACTTGAC TTTAGGTTTA AATCAAGAAG TATCTGACCA GAAACTCTGG 900 

CAGGCCTTGG AGATTGCGCA AGCTAAGGAT TTTGTCAGTG AAAAGGAAGG ACTTTTGGAT 960 

GCCCTAATTG AAGCAGGGGG GCGAAATTTC TCAGGTGGAC AAAAACAAAG GTTGTCTATC 1020 

GCCCGAGCAG TCTTGCGCCA AGCTCCGTTT ATCATCCTAG ATGATGCAAC CTCGGCACTG 1080 

GATACCATTA CAGAGTCCAA GCTCTTGAAA GCTATTAGAG AAAATTTTCC AAACACGAGC 1140 

TTAATTTTGA TCTCTCAACG AACCTCAACT TTACAGATGG CGGACCAGAT TCTCCTCTTG 1200 

GAAAAAGGTG AGTTGCTAGC TGTTGGCAAG CACGATGACT TGATGAAATC CAGCCAAGTC 1260 

TATCGTGAAA TCAATGCATC CCAACATGGA AAGGAGGACT AGAATGAAAC GACAAACTGT 1320 

AAACCAGACG CTCAAACGTT TAGCCGTAGA TTTAGCAAAC CATCCCTTCC TCCTTTTCCT 1380 

AGCCTTTCTA GGAACTATTG CCCAAGTTGG CTTATCAATT TACCTACCTA TTCTGATTGG 1440 

GCAGGTCATT GACCAAGTCC TAGTGGCTGG TTCATCACCA GT 1482 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
{AJ LENGTH: 895 base pairs 
(B) TYPE: nucleic acid 
(C> STRAND EDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
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CTCTGAAGGA 


TCCAGAAAGT 


CGCTTATATA 


TTTTATTAAA 


AGACGGTCAG 


GTTATTGGGA 


60 


CTTGTACGGT 


TGATTTATCG 


ACTAATACGA 


ATTACTTCTA 


CGGTTTAGCA 


ATATTGGAAC 


120 


CTGAACGTGG 


AAAAGGCTAT 


GGAAGCTACT 


TAGCAAAATC 


CCTCGTCAAC 


CAACTAATTG 


180 


AGCAAAATGA 


CAAGGAATTT 


CAGATTGCAG 


TGGAAGATAG 


CAATGTAGGT 


GCCAAACGTT 


240 


TGTATGAAAA 


AATTGGCTTT 


GTCAAACAGA 


CTCAGGTGGT 


TTATCTGAAT 


GAGAAAGGAG 


300 


CAAGGGATTC 


CGAAGTGTAG 


AGATATTCGG 


ACTGAAATTT 


ACTTGAACTT 


TTAGTGATGA 


360 


AACTAATTGT 


TCTTGGATTT 


CAGCTTTCCT 


GATTATGATT 


TATGATTAAA 


ATCTATGACA 


420 


CCATGTCTCG 


TGATTTGCGA 


GAATTTGTCC 


CGATTGAGGA 


CGGCAAGATC 


AAGATGTATG 


480 


TTTGTGGGCC 


AACGGTGTAC 


AACTATATCC 


ACGTGGGAAA 


CGCCCGTTCG 


ACGGTAGCTT 


540 


TTTTGGATAC 


GAATTTCGTC 


GCTATTTTGA 


GTACCGTGGG 


TATAAGGTTG 


CCTATATTTC 


600 


CAATTTTACA 


GATGTGGATG 


ATAAGATTAT 


CAACCGTGCC 


AGGGAAGAAG 


GCATCACGCC 


660 


TCAGGAGGTT 


GCGGATAAGT 


ACATCGCTGC 


CTTTCGTGAG 


GATGTGACGG 


CCTTGGGCGT 


720 


GAAACCTGCG 


ACTCGCCATC 


CGCGTGTAGT 


GGAGTTTATG 


GCAGACATCA 


TCCGTTTTGT 


780 


GGAAGACTTG 


ATCGAGAAAG 


GCTTTGCCTA 


TGAGAGTCAA 


GGGGATGTCT 


ATTTCCGTGT 


840 


AGAAAAATCC 


CACAACTATG 


CTAAATTGGC 


TAATAAAACC 


TTGGAAGATT 


TGGAG 


895 



(2) INFORMATION FOR SEQ ID NO: 27: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



CTGTTTGCCC 


TTTCTTCTAA 


TAAAGAATTG 


GCAGAACGTG 


TGGCGCAGGA 


GATTGGGATA 


60 


GAGTTGGGGA 


AATCAAGTGT 


TCGCCAATTT 


TCAGATGGAG 


AGATTCAGGT 


CAACATTGAA 


120 


GAATCAATCC 


GTGGGAAACA 


CGTCTTTATC 


CTACAATCAA 


CTAGTTCGCC 


TGTAAATGAC 


180 


AATCTGCTTG 


AAATTTTGAT 


TATGGTAGAT 


GCTTTGAAGC 


GTGCGAGTGC 


AGAATCTGTC 


240 


AATGTTGTCA 


TGCCTTACTA 


TGGGTATGCA 


CGTCAGGATA 


GAAAGGCGAG 


AGCGCGTGAG 


300 


CCAATCACTT 


CAAAACTTGT 


CGCAAATATG 


CTTGAAGTAG 


CTGGAGTGGA 


TCGTTTATTG 


360 


ACCATCGACT 


TGCATGCTGC 


GCAAATTCAA 


GGATTCTTTG 


ATATTCCTGT 


GGATCATTTG 


420 


ATGGGTGCTC 


CTCTGATTGC 


AGATTATTTT 


GAGCGTCCTG 


GTATGGTTGG 


TTCTGACTAT 


480 


GTGGTTGTCA 


GCCCGGACCA 


TGGAGGGGTG 


ACTCGTGCTC 


GTAAGTTGGC 


AGAATTTTTG 


540 


AAAACATCTA 


TCGCTATTAT 


TGAGAAACGT 


CGTAGCGTTG 


ATAAGATGAA 


TACTAGTGAA 


600 


GTTATGAACA 


CCATCGGTAA 


GGTTGAAGGC 


AACCACTTGT 


AGCTTCGATT 


GATGATATGT 


660 


ATTGATACCG 


CTGGAACGAT 


TTGTCATGCG 


GCAGATGCTC 


TTGCGGAAG 




709 



(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



( ii ) MOLECULE TYPE: Genomic DNA 
{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



CTAGAAGAAC TCCGTCACAA 


GCTTGTGACT 


TGTAAGCAAG 


AACAGAGCAA 


GATTGAGGAT 


60 


TCCTTATGAT TTCATTCCTT 


CTTCTATTGG 


TCTTGGTTTG 


GGGATTTTAT 


ATCGGCTATC 


120 


GGAGAGGCCT GCTCTTACAG 


GTTTATTACC 


TGATTTCAGC 


CATGGCATCG 


GCTTTTATGG 


180 


CTGGCCAGTT TTATAAGGGG 


CTTGGAGAGC 


AATTCCATTT 


ATTGCTCCCT 


TATGCAAATT 


240 


CGCAGGAAGG TCAGGGGACT 


TTCTTTTTCC 


CATCGGATCA 


ACTCTTTCAG 


CTGGATAAGG 


300 


TCTTTTATGC AGGTATCGGC 


TACTTGCTTG TATTTGGGAT 


TGTCTATAGC 


ATTGGTCGTT 


360 


TGCTTGGTCT TCTCTTACAC 


TTGATTCCTA 


GCAAAAAACT 


GGGTGGTAAG 


TTGTTCCAAG 


420 


TTTCAGCAGG TATCTTGTCC 


ATGTTGGTGA 


CCTTATTTGT 


CTTGCAAATG 


GCCTTGACAA 


480 


TCTTGGCGAC CATCCCCATG 


GCAGTTATAC 


AAAATCCTCT 


TGAAAAGAGT 


ATCGTCGCAA 


54p 


AACACATCAT CCAGAGCATA 


CCGATAACAA 


CCAGTTGGCT 


CAAACAAATC 


TGGGTGACAA 


600 


ATTTAATCGG ATAAAAAGGG 


CAGGAGTTTT 


CCTAGCCCTT 


TGTTTACAGA 


TTTGACTCGA 


660 


ATCTATCAGA ATGTAAAAAG 


CTACCACACC 


TAGACATTCA 


AAGACAAGGA 


AATAAAGATG 


720 


AATAAGAAAA TATTAGAAAC 


ATTAGAGTTC 


GATAAGGTCA 


AGGCCTTGTT 


TGAGCCTCAT 


780 


TTGTTGACCG AGCAGGGCTT 


GGAGCAATTG 


AGACAG 
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(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CTTGAAGCAG CTGAAAGCAT GGGGTCTTCG GGATTTAAAA CCTATTCGTA ACGGGTTGTT 60 

TTACCTTTCC TAGTTCCAAC CTTACTAGCA GCTCCTTGCT TC3TATTTATG AGAGCATTCT 120 

CAGACTTTGG AACGCCTATG TTGATTGGCG AAGGATATCG GACTTTCCCT GTCCTGATTT 180 

ATACCCAATT TATTAGCGAG GTTGGAGGAA ATTCTGCTTT TGCAATTATG GCGATTATCA 240 

TTGCCTTGGC AATTTTCCTT ATCCAAAAAC AC ATTGC AAA CCGCTACAGT TTCAGCATGA 300 

ATCTGCTCCA TCCAATTGAG CCTAAAAAAA CTACAAAAGG AAAAATGGCT GCCATTTATG 360 

CAACAGTCTA CGGAATTATC TTTATCTCTG TTTTACCTCA AATCTACTTA ATTTATACCT 420 

CTTTCCTAAA AACATCAGGT ATGGTATTTG TTAAAGGTTA TTCTCCAAAC AGTTACAAGG 480 
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TAGCTTTCAA 


TCGTATGGGA 


TCTGCTATTT 


TCAATACCAT 


TCGTATCCCT 


TTGATTGCCT 


540 


TAGTTCTAGT 


TGTTCCTATT 


TACGACATTT 


ATCTCCTACC 


TAGCCGTTAG 


AAAACGGAAT 


600 


TTGTTTACAA 


ACTTAATTGA 


CAGCCTCAGT 


ATGGTACCTT 


ATATTGTACC 


AGGAACCGTT 


660 


CTAGGGATTG 


CCTTCATTTC 


TTCCTTCAAT 


ACTGGTCTAT 


TTGGAAGTGG 


ATTTCTTATG 


720 


ATTACAGGGA 


CTGCTTTCAT 


CTTGATTATG 


TCTCTATCTG 


TCAGAAGATT 


ACCGTATACT 


780 


ATTCGCTCAT 


CTGTTGCTAG 


CTTACAACAA 


ATAGCACCAA 


GTATTGAAGA 


AGCTGCTGGA 


840 


AAGCTTAGGA 


AGTAGTCGTC 


TCAATACCTT 


TGCTAAGATT 


ACAACTCCAA 


TGATGCTATC 


900 


TGGTATCATT 


TCTGGAGCCA 


TCTTATCTTG 


GGTCACAATG 


ATTTCAAAAC 


TCTCTACTTC 


960 


TATCCTCCTC 


TACAATGTCA 


AAACAAGAAC 


AATGACTGTA 


G 




1001 



(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1064 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 



CTGGTTTGCA 


ATCTATTCCA 


ACTGAGATTA 


CAGAGGCAGC 


AAGGATTGAT 


GGTGCGACTA 


60 


GCAAGCAAGT 


TTTCTGGAAC 


ATTGAATTGC 


CTTACTTGCT 


ACCAAGTGTC 


TCTATGGTCT 


120 


TTATCCTAGC 


CCTAAAAGGT 


GGGCTGACTG 


CCTTTGACCA 


AGTCTTTGCC 


ATGACCGGTG 


180 


GTGGTCCAAA 


CAATGCCACA 


ACCTCACTTG 


GGCTCTTGGT 


TTATAACTAT 


GCCTTTAAAA 


240 


ACAACCAATT 


CGGTTATGCC 


AATGCCATTG 


CCGTAATCTT 


GTTCCTCTTA 


ATTGTAGTGA 


300 


TTTCGATCAT 


CCAATTGAGA 


GTATCTAAGA 


AATTTGAAAT 


TTAAGAGGAG 


AAGCATGATG 


360 


AAACAAGATG 


AAAGAAAAGC 


CCTGATTGGC 


AAATACATTC 


TATTGATTCT 


AGGATCGGTT 


420 


CTGATTTTAG 


TGCCGCTCCT 


TGCTACCCTC 


TTTAGTTCCT 


TTAAACCCAC 


TAAGGATATT 


480 


GTAGATAATT 


TCTTTGGCTT 


TCCAACCAAC 


TTCACATGGG 


ACAACTTTAG 


CCGTCTCTTA 


540 


GCTGATGGGA 


TTGGAGGCTA 


TTATTGGACT 


CTGTCGTCAT 


CACTGTCTTG 


TCTTTACTTG 


600 


CAGTAATGAT 


CTTTATCCCT 


ATGGCAGCCT 


ACTCCATCGC 


TCGCAATATG 


AGTAAAAGAA 


660 


AAGCCTTTAC 


CA'TTCATGTA 


TACCCTCTTA 


ATCCTCGGAA 


TCTTCGTACC 


TTTCCAAGTC 


720 


ATCATGATTC 


CGATTACGGT 


TATGATGAGT 


AAACTCGGTT 


TGGCTAATAC 


CTTTGGTTTG 


780 


ATCTTGCTCT 


ACTTGACCTA 


TGCGATTCCA 


CAGACCCTCT 


TTCTCTATGT 


TGGATATATC 


840 


AAAATCTCGA 


TTCCAGAAAG 


TCTGGAT3AA 


GCAGCAGAGA 


TCGATGGGGC 


TAATCAATTT 


900 


ACAACCTATT 


TCCGCATCAT 


NTTCCCAATG 


ATGAAACCGA 


TGCATGCGAC 


AACCATGATC 


960 


ATCAATGCCC 


TTTGGTTCTG 


GAATGACTTC 


ATGTTGCCAC 


TCCTTGTCTT 


GAACCGGGAT 


1020 


TCCAAAATGT 


GGACTCTGCC 


TTTGTTCCAA 


TACAACTACG 


CAGG 




1064 



(2) INFORMATION FOR SE* ID NO: 31: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CTGTAAAAGA AGTTAATCAA GTTGATAAAG CACTTTTAAA ACAAATTGGT GCAGTTGATG 
TCTTAGAAGT GAAGGGTGGC ATTCAAGCAA TCTATGGAGC AAAAGCAATC TTATATAAAA 
ATAGTATTAA TGAAATTTTA GGTGTAGATG ATTAAGTACT TACTGACTTA ATAAAAAACA 
GAGGAGAGTG ATGGATGAGT AGGATGAAAT GAAATCGCAT ACAAGAAATA AAGAACTCAT 
TATCCAAGTT GGATACGCTT ATTACATAGG AGAATACAAA TGAAATTTAG AAAATTAGCT 
TGTACAGTAC TTGCGGGTGC TGCGGTTCTT GGTCTTGCTG CTTGTGGCAA TTCTGGCGGA 
AGTAAAGATG CTGCCAAATC AGGTGGTGAC GGTGCCAAAA CAGAAATCAC TTGGTGGGCA 
TTCCCAGTAT TTACCCAAGA AAAAACTGGT GACGGTGTTG GAACTTATGA AAAATCAATC 
ATCCAAGCGT TTTGAAAAAG CAACCCAGAT ATAAAAGTGA AATTGGAAAC CATCCACTTC 
CAGTCCAGTC CTGAAAAATC ACAACAGCCA TCCGAAGCAG GAACAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 

586 



(2) INFORMATION FOR SEQ ID NO: 32: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

TGGTAGTTCT TCCTAGTGCA GGGGCTGTAG ACCCAGTTGC GACCCTAGCG CTGGACTAGT 
CGAGAGGGTG TTGTTGAAAA TGGATGGNTA TCGCTATGTT GGTTATCTAT CAGGTGACAT 
CCTCAAAACG CTTGGCTTGG ACACTGTTTT AGAAGAAACC TCAGCAAAAC CTGGAGAGGT 
GACTGTAGTC GAAGTTGAGA CTCCTCAATC AACAACAAAT CAGGAGCAAG CTAGGACAGA 
AAACCAAGTA GTAGAGACAG AGGAAGCTCC AAAAGAAGAA GCACCTAAAA CAGAAGAAAG 
TCCAAAGGAA GAACCAAAAT CGGAGGTAAA ACCTACTGAC GACACCCTTC CTAAAGTAGA 
AGAGGGGAAA GAAGATTCAG CAGAACCATC TCCAGTTGAA GAAGTAGGTG GAGAAGTTGA 
GTCAAAACCA GAGGAAAAAG TAGCAGTT^A GCCAGAAAGT CAACCATCAG ACAAACCAGC 
TGAGGAATCA AAAGTTGAAC CACCAGTAGA ACAAGCAAAA GTCCCAGAAC AACCCGTGCA 
ACCTACACAA GCTGAGCAAC CAAGTACACC AAAAGAATCA TCACAACAAG AAAATCCTAA 
AGAAGATAGG GGAGCGGAAG AGACACCGAA ACAAGAAGAT GAACAGCCAG CAGAAGCCCC 
AAGAAATCAA GGTTGAAGAA CCAGTAGAAT CAAAAGAGGA GACTGTTAAT CAACCTGTTG 

74 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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AACAACCAAA 


AGTGGAAACG 


CCTGCTGTAG 


AAAAACAAAC 


GGAACCAACA 


GAGGAACCAA 


780 


AAGTTGAAGT 


AACAAGTATT 


CCCCAAACTA 


CTCGCTATGA 


GGAAGACCTT 


ACTAAGGAAC 


840 


ACGGAACGCG 


TGAAGTTGTT 


AAGGAAGGTA 


AGAATGGCAG 


TAGAACAGTT 


ACTACTCCAT 


900 


ATATCTTGAA 


TGCGACAGAT 


GGTACGACTA 


CAGAAGGCAC 


TTCGACAACT 


GATGAAGCTG 


960 


AGATGGAGAA 


AGAGGTTGTT 


CGTGTTGGCA 


CGAAACCCAA 


AGAAAAATTA 


GCTCCAGTCT 


1020 


TAAGTTTGAC 


AAGTGTTACA 


GATAATGCAA 


TGTTGCGTAG 


TGCGAGACTT 


ACTTATCATT 


1080 


TGGAAAATAC 


AGATAGTGTT 


GATGTGAAAA 


AAATTCATGC 


TGAAATTAAA 


AATGGCGATA 


1140 


AGGTTGTCAA 


AACTATTGAC 


TTATCTAAAG 


AGAGATTATC 


AGATGCTGTT 


GACGGTCTTG 


1200 


AACTTTATAA 


AGATTATAAG 


ATTGTGACGA 


GTATGACCTA 


TGATAGAGGT 


AATGGTGAAG 


1260 


AAACCTCTAC 


GTTGGAAGAA 


ACTCCACTAC 


GATTAGACCT 


CAAGAAGGTT 


GAATTGAAAA 


1320 


ACATCGGCTC 


TACTAATCTC 


GTCAAAGTAA 


ATGAGGATGG 


TACTGAGGTG 


GCAAGTGACT 


1380 


TCTTAACAAG 


TAAACCTGTG 


GATGTGCAGA 


ATTACTACCT 


CAAAGTAACT 


TCCCGTGATA 


1440 


ATAAAGTTGT 


TTCCCCTCCC 


AGTTGAAAAA 


ATTGAAGAGG 


TGACTGAGGA 


AGGTCCACCA 


1500 


CTTTACAAAG 


TCCCTGCTAA 


GGCCCTAATT 


TGAT 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



CGGATTTGTT 


TATCTCAAGA 


AGAGTCGCGA 


TATTCTCCGT 


GAAAGTTCAG 


AATTGATTAN 


60 


CCAAACGGTA 


GAAGATTATC 


TTCAAGGAGA 


TGACTTTGAC 


TGGGCAGATC 


TTAAAGGGAA 


120 


GGTTCGANAT 


AATTTGACCA 


AGTATCTCTT 


TGACCAAACC 


AAGCGTCGTC 


CANCTATTTT 


180 


ACCAGTAGTC 


ATGGAAGCAA 


AATAATCGTT 


GAAATAAACA 


GAGAGAAAGT 


CGAGTTTCGG 


240 


CTTTTTCTTA 


TAGAAAAATA 


GAAGGAGAAA 


ATCATGGCAG 


TAATGAAAAT 


CGAGTATTAC 


300 


TCACAAGTAT 


TGGATATGGA 


GTGGGGGGTG 


AATGTCCTCT 


ACCCTGATGC 


CAATCGAGTG 


360 


GAAGAACCAG 


AGTGTGAAGA 


TATTCCCGTC 


TTGTACCTTT 


TGCACGGGAT 


GTCTGGAAAT 


420 


CATAATAGTT 


GGCT7AAGCG 


GACCAATGTA 


GAACGCTTGC 


TTCGAGGAAC 


TAATCTCATC 


480 


GTTGTTATGC 


CCAATACCAG 


CAATGGTTGG 


TACACCGACA 


CCCAGTATGG 


TTTTGACTAC 


540 


TACACGGCTC 


TAGCAGAGGA 


ATTGCCACAG 


GTTCCTGAAA 


CGCTTCTTCC 


CTAATATGAC 


600 


TAGCAAGCGT 


GAAAAGACCT 


TTATCGCTGG 


TCTTTCTATG 


GGAGGCTACG 


GCTGCTTCAA 


660 


ACTGGCTCTT 


ACGACAAATC 


GTTTTTCTCA 


TGCAG 






695 



(2) INFORMATION FOR SEQ ID NO: 34: 

fi) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1096 base pairs 



75 



WO 97/37026 PCT/US97/05306 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CTCATCCAAG AATAGCATAA AAGAACTCTT CTGTTTTTTC TTGAGGAACG AGAAAGTCAG 60 

CAATCTCCAT TCCACGATCC ATCAAATCCT CTAAAGACAT CGTGATTTTT AAAGTAGTAT 120 

CACTAATTTG TTTCATTTTC ATTGCTAGTA ACCTCATACT TTCAGTTCTA TCTATTATAC 180 

TAGATTTTTA CGATTTTATC AAAAGAAGGC TCCTCTATAC GGATAGATTT TCCCTAGGGT 240 

CTTTCTATAG GAGACTCCAA GAGAAAATTT CTGCAGACAG ATAGAAAAAG CCTTCAAAAT 300 

CGGCTCTTAG CCGATTTTGA AGACCTTATA CATCAGAATA CTTATAATTT AAAGGTTGCT 360 

ACACCGAGGA TAGAACGATT TAAGTTTCTG AGAATTTGAA GACTTTGCTC AAATTTCTTA 420 

TAACGAATTC ACTCCGTACT CTTCAACAAG AAGGACTGTA TCTCTTTCCA AAAGAGATGA 480 

TACATCCTGC AAATCTACAA ATGCATTCCT TTTAAAGCTT CTTGACTCTG TTTCAATTTA 540 

TCTAGGATAG CTTTATTTGA GCTAACGATG GTCAATTCCT GTCCAGTATT TTTGTATGAC 600 

AAAACATCTG CTAGGTTAGC AATTGTTGTA ATCTCTGTTA CAAAATCAAT TTGATACTGA 660 

GAAAAATCAC CTACTCTATT GATTGTTGGA TTAAAGAGAT AAACTAACAC ATTTCCCATC 720 

ACAACCAAAA TCACACAAAC CACTCCAATA ACAACTAAAC GAAGAATCAG ATTTTTCACA 780 

TTTAAGCCAA GCGCTGTTTC ACCATTTGCG TTCAATTCTT TAGAGTTGAT GGTTTCCAGT 840 

TTTTCAATTT TCACATTTGC ATAAGCATGT TTAAATTTCT CAATCAACCC ATCAATTTTT 900 

TTCTCTAACA AGTCATTGGC ATCTTTACTT GATGTCAAAA TTTTCACACC AACCCCTGCA 960 

TCGTCAATCA TATAGTAGAC GGTCAATTTT TTCCACCAAT AGTCATTCGT TGAATTTTTC 1020 

AAGGTTGTTT CCGTCGTGTC TAATTCACTG GCAATTTTTT TCAACTCACT GGGTTCTACA 1080 
TCATTGAAAA GATAAG 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1037 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



TCCCCATCCC AGTGGGCAGC AGCATTAGTC TGTACTTCTG CGTCGTTGCG TTTTATTAGA 
AGACTGATTT TGCTTTGAAC TACTTGAAGC TGTCGTTGTA TCCTGCGAAG TTTTTGCTAA 
CTTCAGTTTC ATTTGCAGCG ACATGACTAG TTCCCAAGCC TAGTAAACAG ATACTTGCTA 
ATCCAATTTT TGTTTTCAAT CTTTCCTCTC CTATAAAAAA TGTAACAGAC ATCTGAATGC 



1096 



60 
120 
180 
240 
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TGTTCCACCT AGCTTTTGCT ACTTACTGAT 
CTTTCAGCTT CAAGTGACGC ACCACCTACA 
TATGAAGCAA CATTTTCAGG TTTAACAGAA 
ACTTCTTGAC CAAAGTCAGC AGCTACAACG 
TCGTCTTGTG AAGCTGATTT ACCAGTACCG 
GAGGCAGCAA CTTGTTCAGC AGTCAATCCA 
AATTCAGCAG CTTTACCAGC TTCGTAAGTT 
AGCATACCGT TCGCAAAGAT TGCTTTTGCT 
AAGTAGTCAC GGCGTTTCTG AGTGACCGAT 
AACTTGTGGG CTAGTTTCAC CAGTGAAAGC 
AGCGACTTTT AAGTTTGAGC CTTNGGCAAC 
AGCAGNGATA CCTGCTTCAA CAAGATCTGA 
GAATGNTCCA GCTTNTTGGG GATTTTTGTT 
ACGTGACATT TCACATA 

(2) INFORMATION FOR SEQ 



PCT/US97/05306 

TATTTTACAA AGTCAAGCAA AGCCAAGAAG 300 

AGGGCACCGT CAACGTCTGG GCAAGCCATG 360 

CCACCGTATT GAACACGAAC TTTGTCTGCG 420 

TCACGAACAA CTTTACACAT TTTTTGTGCA 480 

ATAGCCCAGA TTGGCTCATA AGCGATAACT 540 

GCCAATGCAG CAGATACTTG AGCACCTACG 600 

TCAAGTGATT CACCACAACA GATGATTGGA 660 

TTTTTGTTGA TATCTTCGTC AGTTTCATGG 720 

AACAACGTAG TCAGTACCGA TTTCTTTCAA 780 

ACCTGCATTT TCTCCGTAGC AGTTTTGAGC 840 

AGCAAGNACA GTTGTCAAAT CAAGAGCTGG 900 

TGAAGGAAGT TTTGATGCAA CTGCTTCAAC 960 

CATTTTCCAG TTACCAGCGA TAAATGGTTT 1020 

1037 

ID NO: 36: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



CTTATCGTAC 


TAAGGATGGC 


AGTGTTCAAC 


TGTTCCGTCC 


TGATGAAAAT 


GCTAAACGCC 


60 


TGCAACGTAC 


ATGTGACCGT 


CTCTTGATGC 


CAACAAGTTC 


CGAACAGACA 


TGTTTGTAGA 


120 


AGCTTGTAAA 


GCAGTTGTCC 


GTGCGAATGA 


AGAATACGTA 


CCACCATACG 


GAATAGGTGG 


180 


AACTTTATAT 


CTTCGCCCTC 


TTTTGATTGG 


TGTCGGAGAT 


ATTATCGGGG 


TAAAACCGGC 


240 


AGAAGAGTAC 


ATTTTCACCA 


TCTTTGCTAT 


GCCAGTTGGA 


AATTACTTTA 


AAGGTGGTTT 


300 


GGTCCCAACC 


AACTTCTTGA 


TTCAGGATGA 


GTACGACCGT 


GCAGCACCAA 


ATGGTACAGG 


360 


TGCGGCTAAG 


GTTGGTGGAA 


ACTATGCTGC 


AAGTCTCTTA 


CCAGGAAAAA 


TGGCCAAGTC 


420 


ACGCCATTTC 


TCAGATGTTA 


TCTATCTGGA 


CCCATCAACT 


CATACAAAGA 


TTGAAGAAGT 


480 


CGGATCAGCT 


AATTTCTTTG 


GAATTACAGC 


TGATAATGAA 


TTTGTAACAC 


CATTGAGTCC 


540 


ATCTATCTTG 


CCATCTATTA 


CCAAGTATTC 


CTTGCTTTAT 


TTGGCAGAAC 


ATCGCTTGGG 


600 


ATTAACTCCT 


ATTGAGGGTG 


ATGTTCCAAT 


TGATAATCTT 


GACCGTTTTG 


TAAAGGCAGG 


660 


TGCCTGTGGT 


ACAGCAGCGG 


TTATTTCTCC 


AATTGGAGGT 


ATTCAACATG 


GTGATGATTT 


720 


CCATGTTATT 


CTATAGTGAA 


ACAGAAGTAG 


GTCCTGTGAC 


ACGTAAATTA 


TATAATGAAT 


780 


TGACGGGTAT 


TCAGTTTGGC 


GATATTGAAG 


CGCCAGAAGG 


TTGGATTGTA 


AAAGTAGATT 


840 


AAAATAAACC 


AAAGGAGATT 


TTTTATGAAA 


TAGAAAAAGT 


GGCTCTTAAC 


AGCAGGAGTG 


900 


GTCCTGAGCA 


CGTCAGCTAT 


TTTAGTGGCT 


TGTGGAAAAA 


CTGATAAAGA 


AGCAGATGCA 


960 


CCGACAACAT 


TTTCTTATGT 


CTATGCAGTA 


GATCCAGCAT 


CATTGGGCTA 


CAGTATAGCG 


1020 
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ACTCGAACAT CGAGGACAGA CGTTATTGGA AATGTTATTG ATGGTTTGAT GGAAAATGAT 1080 

AAATACGGCA ATGTTGCTCC TTCTCAAAAA GACTATGATT TGAACAGTAC AGGATGGGCT 1140 

CCAAGCTATC AAGATCCAGC GTCTTACTTG AATATTATGG ATCCAAAATC TGGTTCTGCC 1200 

ATGAAACACC TTGGCATTAC GAAAGGAAAA GATAAGGATG TTTGTAACTA AACCTGGTTT 1260 

GGATAAATAT AAGAAATTGT TAGAAGATGC TGTTTCTGAG ACCACTGACC TAGAGAAGAG 1320 

ATATGAAAAA TATGCCAAAG 1340 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 37: 

CTCTGACGGC GTCGCCACTT AAGAAGAGTA TCAAAAAGAA AAATAGAAAA TTAACTAACA 60 

AGGNAGAAAA ACACATGTCT AAAATTATCG GTATTGACTT AGGTACAACA AACTCAGCAG 120 

TTGCAGTTCT TGAAGGAACT GAAAGCAAAA TCATCGCAAA CCCAGAAGGA AACCGCACAA 180 

CTCCATCTGT AGTCTCATTC AAAAACGGAG AAATTATCGT TGGTGATGCT GCAAAACGTC 240 

AAGCAGTCAC AAACCCAGAT ACAGTTATCT CTATCAAATC TAAGATGGGA ACTTCTGAAA 300 

AAGTTTCTGC AAATGGAAAA GAATACACTC CACAAGAAAT CTCAGCTATG ATCCTTCAAT 360 

ACTTGAAAGG CTACGCTGAA GACTACCTTG GTGAGAAAGT AACCAAAGCT GTTATCACAG 420 
TTCCGGCTTA CTTCAACGAC GCTCAACGTC AAGCAACAAA AGACGCTGGT AAAATTGCTG 
GTCTTGAAGT ANGACGTATT GTTAACGAAC CAACTGCAGC AGCTCTTGCT TATGGTTTGG 
ACAAGACTGA CAAAGAAGAA AAAATCTTGG TATTTGACCT TGGTGGTGGT ACATTCGACG 

TCTCTATCCT TGAATTGGGT GACGGTGTCT TCGACGTATT GTCAACTGCA GGGGACAACA 660 

AACTTGGTGG TGACGACTTT GACCAAAAAA TCATTGACCA CTTGGTAGCA GAATTCAAGA 72.0 

AAGAAAACGG TATTCGACTT GTCTACTGAC AAGATGGCAA TGCAACCTTT GAAAGATGCG 780 

GCTGAAAAAG CCAAGAAAGA CCTTTCTGGT GTAACTTCAA CACAAATCAG 830 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



480 
540 
600 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GTGGTGGTTT GACTGCCCTT ATCGAAGCAG GTTTCGAANT CTAAGACANA AGCAGGTTAC 
GCTCCAGAAT TGGCTTAACT TTGAAGTTCT TCACGGAAAT GAAATTGATC GTTGACTTGA 
TCTACGAAGG TGGATTCAAG AAAATGCGTC AATCTATTTC AAACACTGCT GAATACGGTG 
ACTATGTATC AGGTCCACGT GTAATCACTG AACAAGTTAA AGAAAATATG AAGGCTGTCT 
TGGCAGACAT CCAAAATGGT AAATTTGCAA ATGACTTTGT AAATGACTAT AAAGCTGGAC 
GTCCAAAATT GACTGCTTAC CGTGAACAAG CAGCTAACCT TGAAATTGAA AAAGTTGGTG 
CAGAATTGCG TAAAGCAATG CCATTCGTTG GTAAAAACGA CGATGATGCA TTCAAAATCT 
ATAACTAATT AGAAATATAT AGCGCTGGAG ATGATTTTAT GAAAAAGATT ATGAGAAAAA 
TTGCATCGTT ATTATTGGTT CTAGTTGTAT AATGTAATTA CACCGTCGGT AATAGTGCTA 
GCAGACCAAA ATAAAGCAGA TTGGTCGTAT GATGAAAATA CTGTAATTAA CATTTATGAT 
GATGCTAATT TTGAAGATGG TAGGTTGCAT ATGACCTTTG AACAATTCTT CAAATTGGCA 
CAAATAG 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1397 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



CGGTTCCAGA 


NACTGGTATG 


GCTAGGAGTC 


GCGACAAAGT 


GTATCTTATG 


ATGGTCGATT 


60 


TGATACTCAA 


TACGCTCAAT 


ACTTACGGAA 


AGAATAAACG 


ATCATTGTCA 


ACCGGAATAG 


120 


TATGATAATC 


GTTTCTTGGT 


GGAATAAGTT 


CAATCTTATC 


CTTCTTAGGC 


ACAATAATGG 


180 


AAGAGCCCAG 


TGTTCGATAA 


ACACGATTAT 


TAAGGCTGGC 


AATTTCCGTT 


AATTGCAAAG 


240 


CTTCAATGGT 


AGGGTGTAAA 


ACAGCACCAC 


CAAGAGACTA 


GTTATAGGCA 


GTACTACCAG 


300 


TCGGTGTCGA 


AACTGTTAGC 


CCGTCTCCAC 


GAAAACGTTC 


AAAGGGAACA 


CCATTTATTA 


360 


CAATATCTGC 


CACCATGGTT 


CGATCAGACC 


TGCGGATGCT 


GGCTTCGTTG 


AGTGCTCTGA 


420 


AAATCTTAAC 


TTCACCATTT 


TCAAGAAAGA 


CCTTCACATT 


CAGAACAGGG 


TAAGAAACCC 


480 


TTGCCCCAGT 


ATCTAGCTGC 


AAATTAGTCA 


CTAGCTTGTC 


CAACTCAAAA 


TCACGATAAT 


540 


CTGTATAGAA 


GCCCAAATGT 


CCAGTATGAA 


GACCGATAAA 


GCGGACCTTG 


TCAAGCTGAT 


600 


TTTCGTACTT 


ATGAAAGGCC 


GACAAGAGCA 


TACCATCCCC 


GCCAATGGAA 


ATGACAATAT 


660 


CCGGATTGGT 


ATCATTGAGT 


TATAAACTGA 


TTTCTCTTCA 


AACGATCTCG 


CAATTCATAC 


720 


AAAACCCTTT 


GACTCTGCGG 


TTTTCTATTG 


GCTATCAGAT 


CAATTCGTTT 


ACCTGTATTC 


780 


TTCATCTGTA 


TCGTCACTGN 


NTCCAACACC 


GTCATTTAAT 


TTTCTACTCA 


AAGGATCAAA 


840 


AAGTGCCTGG 


GCTTCTTGGA 


TATCATCACG 


AATTTCACCC 


ATTTCTTCAT 


CCAACTGATG 


900 


GGCGATTCTA 


GCTGTAATTT 


CCAGTCGCTT 


CTTAATCTCA 


TCTGGGAAAT 


CCCCTTGGTA 


960 


CTTGTAGTTG 


AGAGAATGTT 


CTATCGTTGC 


CCAGAAATTC 


ATGGCCAAAG 


TACGAATTTG 


1020 
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AATTTCTGCC AAAATAGTCT TAGCTCCATT 
ATGATAGGAA CGATAGCCTG ATGCTTTTCT 
TCGCATATCC TGACGCTTGT GCAAAATATC 
AACCATCACA CGTAAGCCAG CAATATCCTG 
GCCACGACGA GCCATTTTTT CTTTGATGCT 
CTCAATTGGA GAATGCTTAT TTTGCTTACG 
CTTTAACTCA CCAACAG 
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GATGGTATCA ACCGTATATT CTACTACCAC 1080 

ATGAGTAATG TAATCTCGCT CCTGTATGAT 1140 

CACTACTTCC TTGACGTCAT CTACAAACTG 1200 

CAAATCGTGT TCCAAGGTCG CATAAGTAAT 1260 

CTCAATTGGC TTGACTCGAC CGGTCACAAA 1320 

ATATTGCTTA CGAATACCAC GAAGTTTAAT 1380 

1397 



(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CTCCCATGAC ACCTGCACGC TCCGTACGGG CCACGTGCTT TTTAATCCGT TCAACAACTT 
CATAACCCGC TTCAACATCC ACACCCGACT GGGCATATGC ATTTTTATTT GTCATTTGTT 
TTTTCCTTTT CTTTAATGGA GAATCTGTCG CCTACTTGTA AAAACTGGTC TTTTCTTCCA 
AACTTCTACG ATAGTCTTCT TCGTAGTCAT AGAGAGGCGT TGGGTAGTCA CCGTCAAAGT 
AAGCGACACA GAGACCACCG TTCGGCGCAT CTGTTTCGAT ACCAATCGAC TCAATCAAGC 
TATCAATTGA AAGATAAGTC AGACTGTCCG CACCAATGAT TTGGCGAGTT TCTTCGACCG 
TATGATTGGC TGCAATCAGC TCCTGACGGG TCTGGATATC AATCCCGTAG AAACATGGAT 
ACGCTAGTGC AGGACTTCCA ATGGCAACGT GAACCTCAGT CGCACCCGCT TCTTTCAAGA 
GCTGAACGAT ACGACGAGAG GTTGTTCCAC GTACAATGGA ATCATCCACC ATGACCACAC 
GTTTGCCTTT GACAACACCC GAAACAGCAG ACAGTTTCAT CCGCACTCCT TGCTCCCGCA 
ATTCTTGAGT CGGTTGGATA AAAGTTCGCT GGGTGTATTG GTTTTTGATC AGACCCATTT 
CATTTGGTAA GCCTGATTCT TCCGCAAATC CCATAGCCGC GCTTAGGGAA GAATTGGGCA 
CACCAACTAC AATATCTGCC TCATGCTTAA ATTCTCGCGC CAATTGCGCT CCCATTCTCT 
TACGTGCCGT ATGGACATTG ACCCGTGGAT ATTAGAATCA GGGC 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
824 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



CTTCTTCCTT 


GGTACTTCCT 


CAATTGGATT 


ACCTTTGGCA 


TTGCTAGTTT 


TTCACTCCTA 


60 


CCTTATATCC 


AAGGCACTAA 


AATTATGTTT 


TACCGAGCAG 


TATTGGCTCG 


AAAACGTCCA 


120 


AAAGCTTGAA 


GGTTTTCCCC 


TCAAGCTTTT 


TTCTATCAAT 


GAATTTCTCC 


GCCTACCAAC 


180 


GTGAGTTCCT 


GATCTCCATG 


TTCTATCATA 


GCGCCGATTG 


CTGCTTCTAT 


CCCTCGCCGA 


240 


ATATCCACTA 


AACTCATAGT 


TGGAGTAGTC 


GGTCTGTTCA 


CCACCTGTTC 


CATCATATAA 


300 


GGAATATGCA 


TAAAACCTGC 


CTTAACATAT 


GGGAATTTCT 


TTTCTACCAA 


ATAGAGAGCC 


360 


TGATACATCA 


AATGACTGCA 


AACAAAAGTC 


CCTGCACGTA 


TTGGAAACAG 


ACTGCCGGTA 


420 


AGTCCTTCTT 


1TTTATTAGC 


TTGTACCATC 


GCTTTGATAG 


GTAAACTACT 


AAAATAGGCC 


4B0 


GATGCTCCAT 


CAATACGAAT 


CGGTGTATCA 


ATTGGTTGAT 


TGCCTTCGTT 


ATCAGGTATG 


540 


CGAGCATCAT 


CTTGATTAAT 


AGCCACTCGT 


TCAGGTGTTA 


AGCCGGTCCT 


GCCGCCTGCT 


600 


TGTCCAATAC 


AAAGTACAGC 


ATCTGGTTGA 


TATCGTAATA 


TTTCTGCCTC 


TAAAACTTCT 


660 


GACGACTTAT 


AAAAAACCGT 


TGGAATTTCT 


ACCCAGCGAA 


CTTCAGCCCC 


ATTAATCTCA 


720 


GATGGTAATA 


ATTTTACAGC 


CTCCAAAGCT 


GGATTAATCT 


TTTCACCTCC 


AAAAGGATTA 


' 780 


AAACCTGTAA 


CCAATATTTT 


CATTTTATTT 


TCCTTTACTA 


AAATGCGAGA 


AAGTACATTA 


840 


AGAATATGTG 


AATAACAATC 


ATTACTAGAG 


CAACACCTGC 


TTGAGCCTTT 


ATAACGCCAT 


900 


TCTGATCTTT 


CATATCCATC 


AATGCTGCTG 


GTAGAGCGTT 


AAAATTAGCA 


GCCATTGGGG 


960 


TCAATAAGGT 


CCCACAATAA 


CCTGCTGTCA 


TGGCAAGAGC 


ACCAGCCACA 


ATTGGATTAG 


1020 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



CTCGTAAAAA 


CACTAAAGGA 


GGTAAATAAC 


CAATGTTAGT 


ACCTAAACGT 


GTTAAACACC 


60 


GTCGTGAATT 


CCGTGGAAAA 


ATGCGCGGTG 


AAGCAAAAGG 


TGGAAAAGAA 


GTAGCATTCG 


120 


GTGAATACGG 


TCTTCAAGCT 


ACAACTAGCC 


ACTGGATCAC 


TAACCGCCAA 


ATCGAAGCTG 


180 


CTCGTATCGC 


CATGACTCGT 


TACATGAAAC 


GTGGTGGTAA 


AGTTTGGATT 


AAAATCTTCC 


240 


CACACAAATC 


ATACACTGCT 


AAAGCTATCG 


GTGTGCGTAT 


GGGATCTGGT 


AAAGGGGCAC 


300 


CTGAAGGTTG 


GGTAGCACCA 


GTTAAACGTG 


GTAAAGTGAT 


GTTCGAAATC 


GCTGGTGTAT 


360 


CTGAAGAGAT 


TGCACGTGAA 


GCGCTTCGAC 


TTGCTAGCCA 


CAAATTGCCA 


GTTTAAATGT 


420 


AAATTCGTAA 


AACGTGAAGC 


AGAATAAGGA 


GAAGGCATGA 


AACTTAATGA 


AGTAAAAGAA 


480 


TTTGTTAAAG 


AACTTCGTGG 


TCTTTCTCAA 


GAAGAACTCG 


CGAAGCGCGA 


AAACGAATTG 


540 


AAAAAAGAAT 


TGTTTGAACT 


TCGTTTCCAA 


CCTGCTACTG 


GTCCAATTGG 


AAACAAACAG 


600 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



60 
120 
180 
240 
300 



CTTCAGCGAC ATATCTATCC ACAACTTCTC TCGATTCATG TTCCTCTGAA AATGCCTGAA 
ATTTTAATTG ACTAATTTGA TCCTGATACG AACTATCTGC TAACAAAACT TCAAGATGGG 
AAACATTTGC TAACGGATAA GGTCTTCTAT CCTTACCTAA CCAAGTTTCT GTCTCTTCAT 
CCTCTATTAG TCCCCAGTTA CTGGCAAAGT CAGGATGATT CTCTAAAAAA ATACGTTCTG 
TCTGAAAAGT GACTGACCGA ATGGGGAAAG AAGCTGTTCC TCTCTCAAAA CTAGTAAACA 
ATGCACGCGC AATCCCCTGA CGGCGATGAC CTGGATGAAC CAGTATCGTC ACTTCTACAT 360 
CTTGGTCATC TGCATAGACA GTTAATAAAC CAACAAGTTC GCCTTTTTCA TAATAAAGGA 420 
AAAAGGCGGG CATGTTTGGA TCAAAATTAA GCATGTTAGA GAGATAGGGA TCGCGATAGG 480 
TACCGTCATA GTTTTGGCAA CAGTTAATTA CTTTTTTCGC CTCAGATAGC TCTTCTTGGC 540 
TTAACTTGTT TCTTGCTTGA ATCATATAGG TATCCTCTAC AAACCAGACG ATCTGTGACT 600 
GGCATCTTTA GCCTGCTCGA GTTTATTGAC ATAATACTCC CGTTTTTCTT CGACTTCGTG 
AATAACAGGC TCATCTTTCT TACCATGAAG ACGGACAATC TTGGCCGGAA TACCGACAAC 
CGTCACATCA CTAGGTACAT CTGCTACGAC AACTGCTGCA GCACCGACCT TGGCATTTTC 
ACCAATTTCC ACAGGCCCGA TAACTTGGGC ATGGGCTGAT ATGAGGGCTC CCTTTCGTAC 
AGTCGGATGG CGTTTGCCAC AGTCTTTCCC TGTTCCCCCG AGAGTCACTC CGTGATAGAG 
AAGAACGCCT TTTTCAACAA TCGCTGTCTC TCCAATCACC AGACCAGAAC CATGGTCAAT 
AAAGACACCT GAATCAATCT GGGCTCCTGG ATGAATCTCA ATCTGAGTCC AAAAGCGCCA 
AAACTGACTG TGCATACGAG CTAAGAGTTT GAAGTCGTAC TTCCAGAGAA AATGCGAGAG 
ACGGTGGGCC GCCAAGGCCT TGACACCTGG ATAAGTCAGC AAAACCTCCA AAGTGGTGCG 
GGCCGCTGGA TCATTTTCTT TTACGATATC AATGGTTTCG CGCCACCACC CCATACATTT 
CTCCTTTTCT TACTCTGAAT CTT 

(2) INFORMATION FOR SEQ ID NO: 44: 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1223 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND5DNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



GGTGAGTGCC 


TTTATTAGCG 


GAGCTTCTCA 


AGTGATTGTT 


GCCTTGATTA 


TCTTCCTTTC 


60 


ATGCTCTTTT 


ATCTCTTGCG 


TGATGGGAAA 


GGCTTGCGTA 


ACTATTTGAC 


CCAATTCATT 


120 


CCAAGCAAAT 


TGAAGGAACC 


TGTTGGACAA 


GTTCTATCAG 


ATGTGAATCA 


ACAGTTGTCC 


180 


AACTATGTTC 


GAGGGCAAGT 


GACAGTGGCT 


ATTATTGTAG 


CAGTAATGTT 


TATCATCTTC 


240 


TTCAAGATTA 


TTGGTCTACG 


CTATGCGGTT 


ACGCTGGGGG 


TTACTGCTGG 


TATTTTAAAT 


300 


CTGGTCCCTT 


ATCTTGGTAG 


CTTTCTAGCC 


ATGCTTCCTG 


CCCTAGTATT 


GGGTTTGATT 


360 


GCTGGTCCAG 


TCATGCTTTT 


GAAAGTAGTG 


ATTGTCTTTA 


TTGTAGAACA AACTATTGAA 


420 


GGCCGTTTTG 


TCTCTCCATT 


GATTTTGGGA 


AGTCAATTAA 


ACATCCACCC 


TATTAATGTT 


480 


CTCTTTGTTT 


TGTTAACTTC 


AGGATCTATG 


TTTGGTATCT 


GGGGAGTTTT 


ACTTGGTATT 


540 


CCGGTTTATG 


CCTCTGCTAA 


GGTTGTCATT 


TCAGCCATTT 


TCGAATGGTA 


TAAGGTAGTC 


600 


AGTGGTCTAT 


ATGAATTAGA 


GGGTGAGGAA 


GTCAAGAGTG 


AACAATAGTC 


AACAGATGTT 


660 


ACAGGCTTTG 


GAGGAGCAAG 


ATTTAACTAA 


GGCTGAGCAT 


TATTTCGCCA 


AAGCTTTAGA 


720 


AAATGATTCA 


AGTGATCTTC 


TGTATGAGTT 


GGCAACTTAT 


CTTGAAGGGA 


TTGGTTTCTA 


780 


TCCTCAGGCC 


AAGGAAATTT 


ACCTGAAAAT 


TGTAGAAGAT 


TTTCCAGAGG 


TTCATCTTAA 


* 840 


TCTAG 












845 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



CTATGGATTA 


ATCTTCTTCT 


ATCTTATCCT 


GTTCCTAAAT 


ATGTTGGTAG 


AAAAATGAAG 


60 


CAGTTGTTAC 


GAGTGTCTGT 


ATTGGAAAAA 


AATTTTGTCT 


ATACAATTCT 


TTAAATGATA 


120 


TGAAGACTAG 


GAATGCCAAA 


ACACTGACAT 


ATTTCGTTTA 


AAGCCTATCC 


AGTCTCTTCA 


180 


ATTGTATTTG 


TAGAAATGCT 


ATCAGTAATA 


TATTCGCGAA 


CACTATGATT 


TACAACAGTG 


240 


TGGTCGTACA 


TCTTTGTAAC 


TTTATTATAC 


AACAACCATT 


TACAGGTGTA AAGTTGAGCA 


300 


GTGTCTTTAG 


TAAATTTAGG 


GACTTGTTTT 


GTCATAGTTT 


TGATACCTGT 


ATTCGTTGTT 


360 


TCATGAGTAT 


TTGACGTTTC 


TTTTCATGTA 


GGCATTTCAA 


CAATTAGAAT 


GATATTTTTC 


420 


CGAGGATGAA 


CTTATATATT 


ATGACTTTTT 


AGAGTTATGA 


TGCCTCTTGA 


TTTGACTATC 


480 


GCTATCAATT 


TCATCCACTT 


TTACAGTTTC 


ATGCAACATA 


GCGAGAAGGC 


GGTTTCCTAT 


540 


ATGGAAGTCG 


TCTTGTTTTC 


ATATTGTCTA 


AATACGAGAT 


ATTTACTACA 


GGTTGTAGAT 


600 


ATTGCAAGCT 


CAAATGATAT 


TATTTTAGAG 


GAGGAGACAA 


GTGAAATAAT 


GCCrGGTGAG 


660 


TTAGATGAAA 


TAAAAAGTAA 


ggagttgaaa 


GTACTGGTTC 


NTTGTGCAGG 


GTCTGGAACA 


720 


AGTGCGCAAT 


TAGCCAATGC 


AATTAACGAG 


GGCGCTCNAT 


TAGCAGAAGT 


TAGAGTGATT 


780 


GCGAATTCAG 


GACCGCACGG 


AG 








802 
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(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



60 



840 

900 
928 



CTGGGACTCT CTTCGTAGAA GTCATGCAAG AATATTTTGA TCAAAAGAGG AAATCATGAA 

AAAAAGAGCA ATAGTGGCAG TCATTGTACT GCTTTTAATT GGGCTGGATC AGTTGGTCAA 120 

ATCCTATATC GTCCAGCAGA TTCCACTGGG TGAAGTGCGC TCCTGGATTC CCAATTTCGT 180 

TAGCTTGACC TACCTGCAAA ATCGAGGTGC AGCCTTTTCT ATCTTACAAG ATCAGCAGCT 240 

GTTATTCGCT GTCATTACTC TGGTTGTCGT GATAGGTGCC ATTTGGTATT TACATAAACA 300 

CATGGAGGAC TCATTCTGGA TGGTCTTGGG TTTGACTCTA ATAATCGCGG GTGGTCTTGG 360 

AAACTTTATT GACAGGGTCA GTCAGGGCTT TGTTGTGGAT ATGTTCCATC TTGACTTTAT 420 

CAACTTTGCA ATTTTCAATG TGGCAGATAA CTATCTGACG GTTGGAGTGA TTATTTTATT 480 

GATTGCAATG CTAAAAGAGG AAATAAATGG AAATTAAAAT TGAAACTGGT GGTCTGCGTT 540 

TGGATAAGGC TTTGTCAGAT TTGTCAGAAT TATCACGTAG TCTCGCGAAT GAACAAATTA 600 

AATCAGGCCA GGTCTTGGTC AATGGTCAAG TCAAGAAAGC TAAATACACA GTCCAAGAGG 660 

GTGATGTCGT CACTTACCAT GTGCCANAAC CAGAGGTATT AGAGTATGTG GCTGAGGATC 720 

TTCCGCTAGA AATAGTCTAC CAAGATGAAG ATGTGGCTGT CCTTAACAAA CCTCAGGGAA 780 
TGGTTGTGCA CCCGAGTGCT GGTCATACCA GTGGAACCCT AAGTAAATGC CCCCCATGTA 
TCATATTAAG GACTTGTCGG GTATCAATGG GGTTCTGCCT CCANGGATTG TTCACCCGTA 
TTGATAAGGA TACGTCACGT CTTCTCAT 

(2) INFORMATION FOR SEQ ID NO: 47: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CTCTTTTACA GAGGTGTTTC CTATTTTGTG CTTCCATTTT TGGGGCAAGG AATTGTAGAT 60 
GGGGATGGCA ATCCTATCTT TTTATTGATT ATGATATTCG TTTGCTTCAT AGTTTTAGTC 120 
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TTTTTGAAAT GGTTAGACTA TGATTTCACT 
TTTCAAAAGT CTCTTACTAA GATTAACTGG 
AGTCTATCTT ACCTTGAATA TGAACAAGGT 
CTAGTGTTTT ACCTACTCTT TTTTATGGGG 
GAAAAACTTC AGGAGGAACT GAACCAAGAG 
TATAGTCGGC ATATAGAGGA ACTTTACAAG 
AACCTCTTAA CCACTTACGT TTGGGCATTG 
TGTACGACTC GGTCTTAAGG GATTCCAGTC 
GCAGATTGGT GAATATTNGT GACCGTGCCC 
AAG 
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AGATTGAGAA GGGAGTTTCT AGATACAGGT 180 

GCAATGGGGG CTTATTATCT AGTGATGCAA 240 

ATTCAATCAA CGACTGTTCG CCATCTCATC 300 

GGTATCAAGA AATTGGATAC CTATTTGAAG 360 

CAGACCTTGC GCTACAGAGA TATGGAACGC 420 

GAAATTCGGA GTTTTCGCCA TGACTACACT 480 

AAGAGAAGGA TATGGAGCAG ATAAAAGAGA 540 

AGAAATTGCA GGACAATAAA TATGACCTGG 600 

TCAAGAGTCT CCTAGCTGGA AAATTTATAA 660 

663 



(2) INFORMATION FOR SEQ ID NO: 48: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



CTAGATTAAT 


CTGATATTTC 


TTTTTGGCAA 


ACCAAATGCT 


TCCTATCAGA AAAGCTAGCA 


60 


ACAGCAACAT 


GGTAATAATG 


ATATGAATGG 


TCATTTTCTT 


CTCCTATTCT 


GCCTTTTCAA 


120 


TATCTTTTTT 


CATCTCGTCA 


ACATTGAACT 


TAGCAAACAA 


GTATTGACGG 


TCTTGGACTG 


180 


GGAAACGTTG 


GTCCAACTGG 


TCAACTGCTC 


CCACCTCGAT 


AATGCCTTCC 


TTGATGACAA 


240 


AATCCATGAC 


ATGTCCACCG 


AAGGTCAAAT 


CATCTGATAT 


GAAGTGCAGA 


TGGTAGCCTG 


300 


CCACACTGAC 


CCCATGGAAA 


ATCTCAGGCG 


TCCAGAAACC 


AACAATAGTC 


CCCGCAACAT 


360 


TGTCACGACT 


ATATTCCGGT 


TGATGGGTTG 


CGACATCAGC 


AAACTTGGTA 


TCGGGTGTTG 


420 


ACTTAGGAAT 


CATACGCACA 


TGCATATGTT 


AAAAATTCCC 


CCCCGAATCT 


TGATAGAGCG 


480 


GAAAAGATTT 


TCCCCATCAT 


AATAAGACTC 


AATTCGTTCT 


TTCAATTCCT 


TGTCTGTCAT 


540 


CTCAAAGCGC 


TGGCGAAAAA 


TGACCTCTGC 


CTGATGCGGT 


ACCACTGCAG 


CGTAAGGAAT 


600 


AAGGGCATCT 


GGTGACACTT 


CCACAATTTC 


TGGCTGGTCT 


CCTGACCCTT 


TGG 


653 



(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 base pairs 

(B) TYPE: nucleic acid 

(C) ' STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



60 



CTACGATATC TTTGGTCTTT TGTAAGATAT GAGGTCCACC CTTATGCGCC TCAGTTGGCA 

TTTCATGCGA TTCAAGAAGT TGCCCCTCTT GATCAACCAA ACCATACTTG ATGTTGGTTC 120 

CACCGATATC AATTGCAACG TAATATGTCA TAAATACCTC CTTTTAGATT AGAGGAAGCG 180 

CTCCTTGGTT TCACGAATCA AGGCAGCAGC CGCTTCTACA ACTGGACGAT CTTCTTCAGT 240 

CACTGGTGTC AATGGTGAAC GAACAGATCC AATATTCAAG CCTTCATTGA TTTTCAAGAC 300 

TTCTTTGATG ACACCGTACA TATTTCCATG AGCAGAAGTG AGTTTACCAA TGATTGCGTT 360 

GATAGCATAC TGCAATTCAC GCGCTGTTTC TAGGTCCTTA TCCGCAATCA ACTGATTGAG 420 

TTTCAAGAAG AGTTCTGGCA TAGCACCATA AGTACCACCG ATACCAGCCC TAGCCCCCAT 480 

GAGGCGTCCT CCTAGGAACT GCTCATCAGG ACCATTAAAG ACGATATGGT CTTCTCCACC 540 

AAGGCTGACA AAGGTTTGGA TATCTTGAAC TGGCATAGAA GAGTTCTTCA CACCGATAAC 600 

ACGAGGATTT TTCAACATTT CTGTGTAAAG GCTTGGAGTC AAAGCAACCC CTGCCAATTG 660 

AGGAATGTTG TAAATCACGT AGTCTGTGTT TGGAGCTGCA GAACTGATAT CGTTCCAGTA 720 

TTTGGCAACT GAGTTATTCT GGCAAGCGGA AATAAATTGG TGGAATCCGT TGCAATAGCA 780 

TCTACTCCCA AGCTTTCAGC ATGGCGAGCA AGTTCCATAC TATCTTTAGT ATTATTGCAA 840 

GCAACATGGG CAATAATGGT CAATTTACCT TTGGCTACCG CCATGACTTC TTCCAAAATC 900 

AACTTGCGAT CTTCAACGCT TTGGTAGATA CATTCACCAG AAGAACCATT GACATAAGAC 960 
CTTGAACACC TTTATCAATG AAGTATTGA 

(2) INFORMATION FOR SEQ ID NO: 50: 



989 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1671 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 50: 

TCAAAGAGTA ACAAAGGCAC CAAATTCTCG ATAGGAACGA TTTAGCACGG TAAACTTCAT 60 

CCACTTGGGT TCACGGAACC AAACCAGCAA TAATTTCTTT GGGCACGGGT TAATAGCATT 120 

TTGGTCAACT AGGAGTAGAT AGAACACATT TCNTTCTTCG TCTATATCAA TCTTAACACC 180 

TGTTTCAGCG ATAATCTTGT CGATGGTTTC TCCACCCTTA CCGATGACAA TCTTAATCTT 240 

GTCCACATCA ATCTTGATCG TATCAATTTT CGGAGCAGTT GGAGCCAATT CTGGACGAAC 300 

TTCTGGAATG GTTGCTTCAA TGACATCAAG GATTTCAAAA CGCGCTTTCT TGGCTTGAGC 360 

AAGAGCCTCC GTCAAGATTT CTGCAGTAAT CCCTTGAATC TTGATATCCA TTTGAAGGGC 420 

TGTAATCCCA TCACGAGTAC CTGCAACCTT GAAGTCCATA TCTCCAAAGT GATCTTCCAA 480 

ACCTTGGATA TCTGTCAATA CTGTCTAGTT ATTTCCATCT GAGATAAGTC CCATAGCAAT 540 

ACCAGCTACT GGCGCCTTGA TTGGCACACC ACCAGCCATA AGGGCAAGAG TTCCCGCACA 600 

GATAGAAGCT TGAGATGAAG AACCGTTTCA TTCCAAAACT TCTGCTACTA GACGGATAGC 660 
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GTATGGGAAT TCTTCCAAGC TTGGCAAGAC 
GTGACCGATT TCACGACGAC CTGGCGCACC 
AGGGAAGTTA TAGTGGTGCA TAAAGCGTTT 
TTGAGTTTCT CCCATCGGAG CCAAGGTCAA 
AAAGAGACCT GAACCATGTT ACACGAGGAA 
TTTCATCGAC CTTACGACCA TCAGGACGCA 
CTGCGTGTTC CATTTGTTCC AAGATTTCAG 
CGTGGTCCGC ATATTTTTCT TCGTAAACGG 
CAGTTCACGG GAATCCAATT TCTCTTCTAC 
GGCTGCAATG ATTTCAGCTT GCAATTCAGC 
TTCTTTACCG ACAGCAGCAA CGATTTCTTC 
GTGCCCTTTA AGAAGCGCTT CCAACATGAT 
TACCATGTTG ATAGCGTGCT TGGTTCCAGC 
TTGTTCTTGA CTTGGGTTGA TGATGATTTG 
AATTGGTCCG TCAAATGGAA TATCTGAAAT 
AGCCATTGGT GCAGATGCAT TTTCATCATA 
ATTACGGAAA CCTTCCGCAA ACATAGGACG 
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TTGAGCAAGA GCACGCTCAC CAAGGGCACC 720 

GTAACGACCT GTTTCCCCTA CAGAATATTG 780 

CTTGTACTCT GGATCCAAAC CATCAATGAT 840 

GACTGAAAGA GCTTGAGTTT GCCCACGAGT 900 

GGAAGTCAAC AACCGCATCC AAAGGACGGA 960 

CCTTGTCTTC TGTAATTAAA CGTCGCACTT 1020 

CCACATCACG CATAATACGG TCAAATTCTT 1080 

CAGTCACTTG GTCTTTCACT GCTTGAGTTG 1140 

TTGAACTGCC TTTTGGAGGT CACTGTTGTA 1200 

ATCCACGTGA AGCAATTCCA CTTCTGCTTT 1260 

TTGGAAGGCA ATCAATTCTT TGACAGCTTC 1320 

TTCTTCTGAC AATTCTTTGG CACCAGACTC 1380 

TACTGTCAAT TCAAGAAGAG ATTGCTCTGC 1440 

GCCATCTACA TATCCCACTT GTACCCCAGC 1500 

AGACAGTGCC AAAGATGAAC CAAACATAGC 1560 

AGAAAGCACT GTATTGATGA CTTGGACTTC 1620 

AATCGGACGG TCAATCAAAC G 1671 



(2) INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



CTCAGTATCT 


TTGTAATCAA 


CATATCCAAT 


TTTGTTTGCT 


GCGATGTANT 


CAACTTTTTT 


60 


ACGGCGTTTG 


AATCCGCCAC 


GACGTNGTTG 


AGCCATGTTT 


TTNCTCCTTT 


ATAAGTTTAG 


120 


TTGTCCATTA 


GAATGGTAAA 


TCATCAWCTG 


AAATATCCAA 


TGGGTTTGTT 


GCTCCAAATG 


180 


GATTTTCATT 


ACGTGAAAAG 


TCTGGTACTG 


AATTTGTAGG 


TGCTGAATAG 


TTTGCAGTTG 


240 


GTGCAGAGTA AGCTCCACCT GTGTGACCCT 


CACGCACACT ACGGCTTTCC AACATTTGGA 


300 


AATTCTCAGC 


CACGACCTCT 


GTCACGTAGA 


CACGTTGTCC 


TTGCTGGTTA 


TCGTAACTAC 


360 


GAGTCTGGAT 


ACGACCTGTC 


ACCCCGATAA 


GTGAGCCTTT 


TTTAGCCCAG 


TTAGCAAGAT 


420 


TTTCAGCCTG 


TTGGCGCCAC 


ATAACGACAT 


TAATAAAATC 


AGCCTCACGT 


TCACCATTTT 


480 


GACTCTTAAA 


TGTACGGTTT 


ACTGCAAGAG 


TAAAAGTCGC 


AACTGCTACA 


TTTGATGGGG 


540 


TATAACGCAA 


CTCAGCGTCA 


CGTGTCATAC 


GCCCTACAAG 


TACAACATTG 


TTAATCATAG 


600 


TTTACCTTCT 


TACGCGTCAA 


TTTTGACGAT 


CATGTGACGA 


AGAATGTCAG 


CGTTGATTTT 


660 


GGAAAGACGG 


TCAAACTCTT 


TAGAGAGCTG 


CATCGNTCAT 


TTGCTTCAAC 


GTTAACGATG 


720 


TGGTAAAGTC 


C 










731 



87 



WO 97/37026 

(2) INFORMATION FOR SEQ ID NO: 52: 



PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 992 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

CTGGGTTTTA CCACGTTTTC CTGATGTACG AGCGAGATCC TTACGGTTCA ACATAGTGTT 60 
GATAAAGCTG GATTTACCAA CATTTGAACG CCCTGCTAGG GCAATCTCTG GCAGTTCATC 120 
CTGCGGATAG TGGGACTTAT TAGCTGCACT GAGCAAGATT TCAGCATTGT GTGTATTAAG 180 
TTCCATAGTC ACCTCTAGGC TGTTTCTAGG ATCGGTTTAT CCGTTCCATC GACAGTTTCT 240 
TTAGTGATGC GAACCAATTT CACATTTTCC TGACTCGGCA CCTCAAACAT AACATCTAGC 300 
ATGGTTTCTT CGATGATGGA GCGAAGTCCA CGCGCCCCTG TCTTCCGTTC GATTGCTTTA 360 
TTAGCAATCT CTTGAAGGGC TTCGTCGTCA AATTCCAACT CAACATCATC ATAAGAAAGC 420 
AAGGTTTGGT ATTGTTTCAC CAAGGCATTT CTTGGCTCTT TCAAGATGCG AACCAAGTCC 480 
ATCAACGGAC AATTGCTCAA GAGCCGCAAA AACAGGCAAG CGTCCAATCA ACTCAGGGAT 540 
AATACCAAAT TTTTGAATGT CTTCAGCGAT GAATTCTTGC ATGTATGAGC TGTTTCTCTG 600 
TCAATCGCCT TATTGTTTTG ACCAAATCCG ATGACTTTTT CACCCAGACG TTGTTTGACA 
ATTTCTTCAA TACCATCAAA AGCACCACCC ACGATGAAGA GGATATTTTT TGTATCCACT 
TGAATCATCT ACTTGTTGTG GATGTTTGCG TCCACCTTGA GGCGGCACGC TAGCAACAGT 
TCCCTCAATA ATCTTGAGAA GGGCTTGTTG CACCCCTTCA CCAGAAACAT CACGTGTGAT 
AGACACATTC TCACTCTTCT TGGCAATCTT GTCAATTTCA TCCACATAGA TAATGCCACG 
CTCTGCACGT TCGATGTTAA AGTCAGCAAC CTGCAAGAGT TTGAGGAGTA TATTTTCCAC 
ATCCTCACCC ACATAACCAG CCTCCGTCAG AG 

(2) INFORMATION FOR SEQ ID NO: 53: 

( i ) SEQUENCE ' CHARACTERI STIC S : 
(A) LENGTH: 1299 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



ATTTTCACAG GGCAGGATGC GGAACGACTA GCCCCATACT TTAACGGAAT TTTGGTAGGG 
ACAGCTCTTA TGCAGGCAGA AAATGTGGCC CAGAGAATCA AGGAGTTGCA GATTGACAAA 



660 
720 
780 
840 
900 
960 
992 



60 
120 
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GGTTAAAATT 


TGTGGACTAT 


CGACCAAAGA 


AGCGGTGGAA ACAGCCGTTT 


CAGCAGGAGC 


180 


CGACTATATC 


GGTTTTGTCT 


TTGCACCTAG 


TAAAAGACAG GTGACTTTAG 


AAGAGGCAGC 


240 


TGAGTTGGCA 


AAGCTTATTC 


CTGCAGATGT 


GAAAAAGGTT GGAGTATTTG 


TTTCACCAAG 


300 


TCGGGTAGAA 


CTGCTGGAAG 


CGATTGACAA 


AGTTGGCTTG GACTTGGTTC AAGTTCACGG 


360 


TCAGGTGGCA 


GATGATTTAT 


TTGAGAATTT 


GCCTTGTGCC AGTATTCAGG 


CTGTGCAGGT 


420 


AGATGGAAAT 


GGGCATGTCC 


CCAATTCTCA 


GGCAGATTAT CTACTCTTTG 


ATGCCCCTGT 


480 


GGCAGGAAGT 


GGCCAGTCCT 


TTGATTGGGG 


TCAACTGGAT ACGACTGGAC 


TAGCACAGCC 


540 


CTTCTTTATC 


GCAGGTGGCC 


TTAATGAAGA 


TAATGTAGTA AAAGCAATTC AACATTTTAC 


600 


TCCCTATGCA 


GTAGATGTAT 


CGAGCGGAGT 


GGAGACAGAT GGACAAAAAG 


ATCATGAAAA 


660 


GATTAGAAGA 


TTTATAGAGA 


GGGTAAAGCA 


TGGCATATCA GGAACCAAAT 


AAAGATGGAT 


720 


TTTACGGAAA 


ATTCGGCGGA 


CGTTTTGTCC 


CAGAAACATT GATGACAGCA GTTTTGGAGT 


780 


TGGAGAAGGC 


CTACCGTGAA 


AGTCAGGCAG 


ACCCAAGTTT CCAAGAGGAA 


TTAAACCAAC 


840 


TCTTGCGCCA 


GTACGTAGGA 


CGTGAAACTC 


CTCTTTACTA CGCAAAAAAC 


TTGACCCAGC 


900 


ATATCGGCGG 


AGCCAAGATT 


TATCTCAAAC 


GGGAAGACCT TAACCATACA 


GGAGCCCACA 


960 


AGATTAACAA 


TGCCTTAGGA 


CAAGTTTGGC 


TTGCCAAACG CATGGGTAAA 


AAGAAAATTA 


1020 


TCGCAGAAAC 


GGGTGCTGGT 


CAGCACGGTG 


TGGCAACTGC AACTGCTGCG 


GCCCTCTTTA 


1080 


ACATGGAATG 


TACCATCTAC 


ATGGGTGAGG 


AAGATGTCAA ACGCCAAGCC 


CTCAATGTGT 


1140 


TCCGTATGGA 


GCTTTTGGGA 


GCTAAGGTTG 


AAGCAGTGAC AGATGGTTCG 


CGCCTGCTCA 


1200 


AGGATGCGGT 


CAATGCAGCC 


CTTCGTTCAT 


GGGTGGCTAA TATCGACGAT 


ACCCACTATA 


1260 


TCCTTGGTTC 


TGCCTTGGGG 


CCTCATCCAT 


TTCCAGAAA 




1299 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1078 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: , 54: 



CTGTTGAATC 


AACAACACGT 


ATCCAAGTAT 


CACNATCAAA 


TTATGCAGGT 


AATCGTACAA 


60 


TTGGAAATCA 


CCGTGGATGG 


TTCAATCCAA 


CAACAACTTC 


TGAAGGTTTT 


GTTACATATA 


120 


TTTATGCAGA 


TTAATTTACA 


GAGGGACTCG 


AATAGAGCCC 


TCTTTTCAGG 


TTTTACCGTG 


180 


ACAATCCCTA 


TTAAAAATTA 


TATCAAAATA 


GCTTGAAAAT 


ATTGGAAAAG 


TATGGTAGAA 


240 


TGAAAATTGT 


CGTGTGAACG 


ATAATACTCA 


TTCTTGATGA 


ATTGTGAAGC 


AGTTGCCCTT 


300 


GGGTCGTTTT 


GCGAGTTGAA 


GTCAAGAAGA 


GGAAAAAAAC 


AAAAAGGAGA 


AATACTCATG 


360 


GCAGTAATTT 


CAATGAAACA 


ACTTCTTGAG 


GCTGGTGTAC 


ACTTTGGTCA 


CCAAACTCGT 


420 


CGCTGGAATC 


CTAAGATGGC 


TAAGTACATC 


TTTACTGAAC 


GTAACGGAAT 


CCACGTTATC 


48C 


GACTTGCAAC 


AAACTGTAAA 


ATACGCTGAC 


CAAGCATACN 


ACTTCATGCG 


TGATGCAGCA 


540 


GCTAACGATG 


CAGTTGTATT 


GTTCGTTGGT 


ACTAAGAAAA 


CAAGCAGCTG 


ATGCAGTTGC 


600 


TGAAGAAGCA 


GTACGTTCAG 


GTCAATACTT 


CATCAACCAC 


CGTTGGTTGG 


GTGGAACTCT 


660 
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TACAAACTGG GGAACAATCC AAAAACGTAT 
GAAGAAGATG GAACTTTCGA AGTTCTTCCT 
CGTGCGCGTC TTGAAAAATT CTTGGGCGGT 
GATGTACGTA GTTGACCCAC ATAAAGAGCA 
GAATCCCAGT TGTAGCGATG GTTGACACCA 
TCCCAGCTAA CGATGACGCT ATCCGTGCTG 
CTATTATCGA AGGACGTCAA GGTGAGGATG 
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CGCTCGTTTG AAAGAAATTA AACGTGGATG 720 

AAGAAAGAAG TTGCACTTCT TAACAAACAA 780 

ATCCGAAGAT ATGCCTCGTA CCCCAGATGT 840 

AATCGCATGT TAAAGAAGCT AAAAAATTGG 900 

ATACTGATCC AGATGATATC GATGTAATCA 960 

TTAAATTGAT CACAGCTAAA TTGGCTGACG 1020 

CAGTAGCCGT TTGAAGCAGA ATTTGCAG 1078 



(2) INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

CTATGAGGGA TATAAAGTGA GAGAGTTGGG CCAACGTTCC AAAAACGTCC TAAATTCTGC 
CCATTGACAA AGGCAACCCC CTTACCAAAC TCAGACNAGT CTAGGTAAGT ATCTTTTGGC 
TATTCGACTG TAAAGTCATA AGCGTAAAAG GCTGGTTGTC CTTGAGTCCA TCCTTTGGAA 
AAATCAATTT TCTCAGGATT GTCTAGTGGG AGTGGATAGT GTTTCCAGTT TAGTAAGAAA 
TGCAGATCCT TACAGACCCC TGTCCGAATT CCCTTACGTT GCGTATCCGC TAAGAACTTA 
TGCCCATAGT TCACACGCCC CATATTTTCT ATCAAGATAT CTAACCTAGA TAGCCCTTTC 
TTTTTACCTT GATAAAAAAT ATCTTCCCCA ATCTCTGTCT GATATTGAGT TTTAACCCAC 
TGACCATCGA CATACAGCTG GGCCCTATCT CGACCATCAA TGATACGAAG TCTTTCTTCT 
TCTGCATCCC AGTTTGTTTC TGTTCGATAA AGTAGGTAGC CATAACTTTG TCCCAGCTCC 
TCCATCTTTT GAGGATAGAG ACTTTCTACA GGGACTTGAC AAGCTATCTA AGGTTTCAAA 
CAAAGAAACT TTTTCAACTA GTGGAATAGC ATCCAACTCC ATACTCTCTT TGTAGAGTGG 
TTCCAACTGC GGATACTCTG AAAAATGTGT TGCCATCATC TTCTTGACTG CAAGATATTT 
AGCAGTTGGA TTTCCTTCTT CATCCAGAAG GGCATCGTAA TCATAAGACG TTAACTTGTG 
GCAGGTCCAA ATTCCTCGAG 

(2) INFORMATION FOR SEQ ID NO: 56: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
800 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 769 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYFE: Genomic DMA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



CTCACGCAGT 


TCTGTTACCA 


AAGACCGTTC 


AAAAAGCAAA 


CTAGCAATAC 


a lAinAvi in 


o \j 


TTAGAAATTC 


CTAATTTTAC 


CTTGCTAAAC 


GAATACAGGT 


CACGATGCCA 


TCTGAGTCTG 




TGATTTCCAA 


TTGGCTCGAT 


GTCATCCATT 


TGATTGGTAC 


GTCAACTTCT 


TGTGCTCGTT 


ion 
lbU 


GGGC AATCGT 


TAACAGTTCT 


TCTTTATGTG 

A \» AAA F\ A VJ A W 


CAACTTCGAT 


GACATAGTAG 


GCTAAACCTG 


240 






» c*c*r* "hr*T^ r*r*rn 


TTTCCTTCCC 


CATTCGTTAA 


CTGCTAAATG 


300 


ATGATGGTAA 


TCTCCAGCCG 


CAATCCAACT 


AGCGCCAGGC 


ACACTGAATT 


TATTCTCTAG 


360 


CCCTAACACC 


TTTTGATAAA 


ACTGGCTGGA 


CTTTCGACTA 


TCCTTGACGG 


AAAGATGAAT 


420 


ATGCCCCATT 


CTTGTACCTT 


CTGCCAGGAT 


AAAGGGCTCT 


ACTCTTTCCC 


CCAACTCATA 


480 


AATGTCCTGC 


GCCGCAAGAG 


CCTCCGTCAC 


TCCGATAATG 


CGTCCATCTT 


CTCGAATATC 


540 


CCATGTGGAA 


ACTGGCTTAT 


CTCGATAGAG 


TTCAATGCCA 


TTTCCCTCCA AATCTTCCAA 


600 


GTAAATAGCT 


TCACTGTAAC 


CATGGTCTGC 


ACCGCCGACA 


AGAGGAATTT 


GCAAATCTGT 


660 


CAGATGTTTC 


AAGACATCAG 


CCAAAGCCTT 


GCGTGTGGGC 


AAGAGAATGG 


CCAAATGGTA 


720 


AAGACCATAA 


TGTTCCCTCA 


CTTCTCCGCT 


CTTCTTGTGC 


TTGAATCAG 




769 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 637 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



TTACCCCAAG 


CCTGCAACTT 


CCCAAGCTGA 


TTGGTTGAAT 


GATTTGTCAT 


CAACACCACC 


60 


AGTATCAGTG 


ACGATTGCTG 


CTTTTGTCTT 


CACATCAGAA 


GATGAAGCTG 


CGTTACGAGA 


120 


AGAGCGGTTA 


CCACATGCAG 


CAAGTCCAAC 


TGCTGCCACT 


GCAACTAGGC 


CAAGACCTAG 


180 


CCATTGTTTC 


TTGTTCATTA 


CTGAACCTCC 


TAAATAAGAT 


GTGCAACGAT 


GTTGCAAGTA 


240 


TGGATTGATT 


GGTCACAAAG 


ACCCTTGCCA 


CTCAAAGAGC 


GACTCAGACT 


AATTTAAGTC 


300 


TGTAAAAGAA 


TATGGAAGTA 


ACTCCCCGAC 


CGTCATCTCG 


ACCGTCGATT 


TATCTTTTGC 


360 


GACTAAGGTC 


ACTTTTAAAT 


CTTGTTCAAA 


AAATTCGACC 


ATTACTTGGC 


GACAAGCACC 


420 


ACATGGTGAA 


ATCGGTTTTT 


CAGTTTGACC 


ATAGACAATC 


AATTCTGAAA 


ATTCTCTTTG 


480 


GCCTTCAGAT 


ATAGCCTTAA 


AAATAGCTGT 


TCTCTCACCG 


CAATTGGTCC 


AAAGGATAGC 


540 


TAGCATTTTC 


GATATTCACT 


CCCGTGTTAA 


ACATTTCCGT 


CTTTGGCTAC 


TAAAACTGCT 


600 


CCGATAGGAA AGTGAGAATA 


GGGGACATAG 


GCATGTT 






637 



(2) INFORMATION FOR SEQ ID NO: 58: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 940 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



CTGAGGATAA 


TGACATCTGC 


CANAGGAGCA 


AAACTAGCAA 


AATCATCNGT 


TGCACGGTCA 


60 


ATCATGCCTT 


CTTTCAAGGC 


GATATCTCTC 


GAAGCTTGAC 


TACGGATTAT 


AACCTAAAAT 


120 


TTCATAATCT 


GGATGATCGC 


GTTTGATACC 


AAGTGCCATA 


GAGGCTCCAA 


TCAACCCAAG 


180 


ACCTGCGATA 


TAGATTGTTT 


TTGCCATAGG 


AACTCCTTAA 


TAGTTCTTTG 


TATAGTCTCG 


240 


GTGTTTGGCT 


ACCGCTTCTT 


TTAGTTCCTC 


AAGATTATCT 


GATGAGAATT 


TTTCGAGGAT 


300 


TTCTTGCGCC 


AGAACCGTTG 


CTACAACTGC 


TTCCATGACC 


ATTCCTGCAG 


CTGGAAGAGC 


360 


AGTCGGATCA 


CTTCTCTCCA 


CGGTTGCCTT 


GTAAGGTTCG 


TGGGTTTCGA 


TATCCACACT 


420 


CATAAGAGGT 


TTATAAAGAG 


TAGGAATGGG 


TTTCATGACC 


CCACGAACAA 


CGATGGGTTG 


480 


CCCATTAGTC 


ATACCACCTT 


CAAAACCACC 


TAGATTATTG 


GTACGGCGAG 


TATAACCGTC 


540 


TTCTTTAGAC 


CAGAGAATTT 


CATCCATAAC 


TTGGCTGCCT 


TTACGATAAC 


CAGCCTCAAA 


600 


GCCAAGACCA AATTCCACCC 


CTTTAAAGGC 


ATTGATAGAG 


ACAACAGCTT 


GAGCCAATCT 


660 


TGCATCCAAT 


TTTCTATCCC 


ATTGGGACAT 


AGGAACCAAG 


ACCAACTGGG 


AACGCCTCCG 


720 


ACGACTGTCT 


CCACAACCCC 


ACCGATGGTA 


TCACCATCAC 


GTTTGATTTG 


GTCAATATAG 


780 


TCCTTGATTT 


CCTGTTCTCG 


TTCTTGGTTG 


ACAATAGAAA 


ACTTCCAGAC 


TGGGCAGCTC 


840 


TTTGCTTTAA 


TTTCCAGCGA 


CTGTCAGATT 


TTCCAGGAAC 


ATCCATTCCC 


TTGCCACCAA 


900 


AGACCACGAC 


ATGGTTGGCA 


ATCTCCATAT 


CCAGATTCGA 






940 



(2) INFORMATION FOR SEQ ID NO: 59: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



GATTCTACAG 


GATCCATTTT 


ACTATCTTAC 


GCGCCGGGAA 


GTAGGCTGAG 


ACATAACCAA 


60 


GTAATAGAGC 


CAACAACTAG 


AGTTCCTAAA 


ACAGATAAAA 


GATTTAATTC 


AAAAACCTTA 


120 


GTGATGGATG 


GGTAAAAGTG 


ACTTACAATC 


GCATTCGCCA 


AACTTCCCAC 


CCCTTGTGCA 


180 


ACCAAAAATG CCAGCAGCAA GGCGATGCCT 


ACAATCCAGA 


TAGCCTCGTA 


AATAAAAATT 


240 


CCTTTGACAT 


CACGATTCTG 


ATAACCAACT 


GCTTTCATGA 


CACCTATTTC 


CTTGGAACGT 


300 


TGCATGATAT 


TGATGTAAAT 


AATGATACCA 


ATCATAACCG 


CTGCTACCAC 


AATAGCTTGT 


360 
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GATGAAAGCA CAATCAATAA TCCCTGAATA ACACGAATAA AGGTAATCAC AATATCAAGA 
ACTCTCTGTT AAGAAAGCAC AGTATACTTC TTATTTTTCT GTAATTCTTC TGTTACTACT 
TTTGTCTGTG ATGGATCTTT GAGTTCCAAG ATAAAATAAG ATACAGCTTT CGTAAATCCA 
GCCTCTTTCA AAATCGTTTC CATTTGATGA GACAGCATGA AACTGTTGCT GTCCTCCATG 
TCATCTTCAT CATTGATTAC ACGTACAATC TTCGTTTGAA ATTGAGCAAT CTTACTAGTT 
TCGGCAGCAC TTTCTACAAT GCTGACTGAG ACTGATTTGC CAATAAGATC ATTAGCTGTC 
AAATTTTTTC CTGTCTGTTC ATTCCAATTT TTTAGTAAAG CTGCTTGGAA TCGTTAATCC 
CTGTTCATTT GTATCAGTAT AGAGGGATCC AGCCAACACT TTGTCCGTCT CATTATTACT 
AACAGAGATA CTTGTATCAT CATAAAGACT CACTACTTGA GCATAAGAAG CATCGTTTGA 
CTCAAATCCA TTTCTTGCCC ATCTTTTCTT GCCCATCTAT AGTAATATTT GACATGTTCA 
TCCCNAAAGG ACTCTCCAAA TATNNNATAG 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


SO: 






ATCGGCTAAG 


ATGCGTGGAG 


CATCACAAAT 


TATCCTTATG 


AGCCGTCATG 


AAGACGTCAA 


60 


AAGATGGCTA 


TGGAGTCAGG 


TGCGACAGCT 


GTTGTTGCAG 


AACGTGGTCA 


AGAACGAATT 


120 


ACCAAGGTGC 


GTGAAATCCT 


CGGCGGAGGA 


GCAGACGCAG 


CACTTGAATG 


TGTTGGTACG 


180 


GAGGCTGCTA 


TAGAACAGGC 


GCTAGGTGTT 


CTTCATAATG 


GAGGGCGTAT 


GGGCTTTGTA 


240 


GGAGTCCCAC 


ACTATAATAA 


TCGTGCTCTT 


GGTTCGACAT 


TTATGCAAAA 


TATCTCTGTA 


300 


GCAGGTGGGG 


CAGCTTCTGC 


TACAACATAC 


GATAAGCAAT 


TTTTACTAAA 


AGCCGTCCTT 


360 


GATGGTGATA 


TCAATCCAGG 


TCGCGTCTTT 


ACTTCAAGTT 


ATAAACTGGA 


AGATATCGAC 


420 


CAAGCCTACA 


AAGATATGGA 


TGAACGTAAG 


ACCATTAAGT 


CTATGATTGT 


AATCGAATAA 


480 


AAAACGAATA 


GGAGTTTTAG 


AACTCTATTC 


GTTTTTTATG 


TTATCCTATT 


CTTGATTTAA 


540 


GGTACTTTCT 


CTTAATGTCA 


GTCTGGTTCC 


CAGCATGGTC 


AGGCTAGGGA 


TTTTCCGACC 


600 


GTGGAGGACT 


TCCTTGTTAA 


GAATATCCAT 


ACCTGCTCGG 


CCCATTTCTT 


CAGTATAAAC 


660 


TGTAATACTA 


AGAGAGGGGA 


GGATAGACCT 


GTTTGGTCAG 


ACTAAGTGTC 


GTTAAAAGGA 


720 


AAATGAGGCT 


GACGCGATCT 


GGCAGGCTGA 


TTCCAGCTTC 


TTGGAGGGCA 


CGGAGGGCAC 


780 


CGATAGCTAA 


ACTATCGCTG 


GCTGCGAAAA 


ATGCTGGCGG 


AAGTTGGTCT 


CCCAAGCTCT 


840 


GAATGGCCTC 


CTTCATTAAG 


TCATAGCCAG 


ACTGGGCAGT 


AAATCTTCCT 


TGAAAGACCA 


900 


GTTCATCATG 


ATAGATTCCC 


CTCGCTTGAC 


TGTAGTTTTT 


GAAGTTTTCT 


AGACGCTTGT 


960 


CCTGAATGAT 


TTCTTCTTGG 


TCTGTAGiTT 


CTTCAAGGCC 


TGTTAGAATC 


CCGATACGGT 


1020 


CCATTCCTTG 


ACTGAGGAAA 


TAATCGACAA 


CCTGTTTCAT 


AGCAGTGTAA 


AAATCCGTGA 


1080 


TAATACAGGT 


ATGTCCCAGG 


GAAAGTGTAT 


CGCTGTCTAG 


AAATACAAGA 


GGCTTTTGGT 


1140 


ATTCTTCAAA 


GGCAGA^ATC 


TGAGCTTCTT 


GATAGGATTG 


ATTCTGCTAG 


GGATAACAGC 


1200 
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CAAAGGCAAT CCTCCCTTTA TCCACCTCAT ATTTGAAACC ATTTCAGCTC TTA 1253 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



CGTGTACCAT 


CAAGAGGCTA 


ATTGATGTTT 


CTTCCCCCTG 


TGCCTCTCTT 


GTCAACATTT 


60 


CTGTTTGAGC 


CTTATGCGCC 


TGTAATAAGG 


CAGCATTCGC 


ATCAGCTAAT 


CTTCGATTGG 


120 


CTTTAGAGAA 


ATCCCCTTTT 


TTTGCTGCCT 


GAATAGCCTC 


CATAGCATTA 


CTTTTGGCTT 


180 


CCCCACCATA 


CATAATTAGC 


CCCATTACAG 


ATTCTAAATT 


ACTTTCATCC 


ATAACATTCT 


240 


CCTCAATCTT 


AGTCTAATAA 


ACTTTCAGCT AAATCTAAGA 


CTTTAGAACC 


ATTCATCATT 


300 


CCATAATCTG 


TCATCGGAAT 


AACCGCCACA 


GGAATCTGTC 


TATCTTTTAG 


TTTTTCTTGA 


360 


AAGTCCCCTA 


GTAAATAGCG 


AACTTGAGGA 


CCTAAAAGCA 


ACACATTTAC 


TTCTTTTGTT 


420 


GCTACAATTT 


CTTCTGCTTC 


AGGAGCTGGA 


ACTGCAAAAA 


TTTCTGCATC 


CAACCCCTTA 


480 


TCTTCTGCTG 


CCTTTTGCAT 


CTTTGTCACT 


AGCAGACTTG 


TACTCATACC 


TGCTGCACAT 


54 0 


GCTAACATAA 


TTGTTACTTT 


AGCCATAGCT 


TACTCCTTTA 


TCACCTATTT 


ACTGACCTAG 


600 


CCAAGCGAGA 


TACTGTATGT 


CTCACTCCTC 


TAATTGCCTT 


GCTCAATGCA 


TAAATATTAT 


660 


CGACTGTTGC 


CAGTCCCCCA 


TATCCTGCGT 


CACCTATATG 


TTGAATATCA 


ACTCCACAAA 


720 


TTTTATTTCT 


AAGTGCAATT 


TCCTTAATAG 


TATCTGTATC 


AGATGTTTCT 


TGGCTAGTAC 


780 


CAATAGCACT 


CAATACAAGA 


CCACCCTTAC 


TATGAACGAG 


ATCAACGACT 


TCACGCAACT 


840 


CTTGGTCATG 


AAAGGCTGGA 


ACGGTTCCAA 


CTGCTGGAAC 


AAGTATCACA 


TCCGCACCTG 


900 


CTTCCAACAA 


TTGCTCTGCG 


ACAGAAAGCT 


CTGCCACAGG 


CTCATTCACT 


CCTGCCCCGT 


960 


GCATCTTACC 


TGCAATAATC 


AAACCAGAAA 


AGTTTTCCTT 


AGCAGTTTGA 


ACAGCCTTAA 


1020 


TGATTTCTCG 


ATTGCTAACT 


CCTGTTCCAG 


GATTTCCAGT 


CAAACAGACA 


AAGTCAAAAC 


1080 


CTAACTCCTA 


TATTCGCTTC 


AATGTTTCAA 


CACTAGCAAC 


ACGACCTGCA 


ACAATTTCCT 


1140 


GTGTTTCCTC 


TAGCATTCTT 


TGCAGATGGG 


TCAATCGGTT 


CCAAATTTAC 


ACCAATTGGA 


1200 


CAAGCAACAA 


G 
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(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DMA 



PCIYUS97/05306 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 



60 



CCACACTAGC AGCCTATGGA CTCACTAAAA GAAAAGAAGA CTAAGTCTTT TCGATAAAAA ou 

ATAAACAGCG AGATTGAAGC TCGCTGTTTA TTTTTTAATT AATCACCTAG TCCAAGACGT 120 

TCAAAGATAT CATCCACTCG TTTGGTGTAA TAAACTGGGT TGAAGATTTC ATCGATTTCT 180 

TCTTGTGTGA GACGTGATGT TACTTCTGAA TCTGCCTCAA GAAGTGGTTT AAAGTCTACT 240 

TGGTTGTCCC AAGAGTAGGC TGTTTTTGGT TGCACCAAGT CATAGGCTTG CTCACGGGTC 300 

ATGCCTTTTT CAATCAATGT CAACATAGCC CGTTGGCTAA AGATAAGACC AAAAGTCGAG 360 

TTCATGTTTC GGATCATATT TTCTGGGAAG ACTGTCAAGT TCTTGACGAT ATTTCCAAAA 420 

CGGTTGAGCA TGTAGTCAAT CAAAATGGTC GTATCTGGTG TGATGATACG CTCAGCTGAT 480 

GAGTGAGAAA TATCGCGTTC GTGCCAGAGA GCGACGTTTT CATAAGCCGT AATCATGTGA 540 

CCACGAATGA CACGCGCCAG ACCAGTCATA TTTTCAGAAC CGATTGGGTT GCGTTTGTGA 600 

GGCATTGCTG AAGACCCTTT TTGCCCTTTA GCAAAGAACT CTTCTACTTC GCGTTGCTCA 660 

GATTTTTGTA GACCACGAAT CTCAGTCGCC ATACGTTCGA TTGAAGTCGC AATGCTGGCA 720 

AGAACCGCAA AGTACTCAGC GTGAAGGTCA CGAGGAAGGA CTTGTGTTAA AGATTCCTTG 780 

GGCACGGATG CCAAGTTTAT CGCAGACATA CTCCTCTACA AATGGTGGGA TATTGGCAAA 840 

GTTCCCAACC GCACCAGAAA TCTTACCAGC TTCTACACCA GCAGCCGCAT GCTCGAAGCG 900 

■ CTCCGATATT GCGTTTCATT TCGCTGTACC AAGTTGCTAA TTTAAGGACC AAAGGTTGTC 960 

GGCTCAGCGT GCACACCATG AGTTACGCCC CATCATGATG GTGAACTTGT GCTCCTTGGC 1020 

TTGTCAGCGA TGATATTAGT GAAGTTTTCA AGGTCACGAC GGATGATGTC GTTGGCCTGC 1080 

TTGTAGAGGT AACCATAAGC AGTATCCACC ACGTCGGTAG AAGTTAACCC ATAGTGAACC 1140 

CACTTGCGCT CTTCACCAAG AGTCTCAGAA ACCGCACGCG TGAAAGCCAC CACATCGTGG 1200 

CGCGTCTCCT GCTCAATTTC CAAAATACGG TCGATGTCAA AGTCCGCCTT CTTGCGAATC 1260 

AAAGCCACAT CTTCCTTAGG GATTTCCCCA ACTCAGCCCA TGCCTCGTCA GAGAGGATTT 1320 

CCACCTCAAG CCAAGCACGG TATTTATTTT CTTCACTCCA AATATTCGCC ATCTCAGGGC 1380 

GAGAGTAACG GTTGATCATG TGTTAATTTT TCCTTTCTTC TTAAG 1425 

(2) INFORMATION FOR SEQ ID NO: 63: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 base pairs 

(B) TYPBr nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 



CTATTCTGAG TATAAAGTTG GCGGAGGCTG GAACTACGCT CGTTATGAGG TCATAAACTA 
CTATACPGGA GGTTATTAAT CCTTAAAGAG TGAGAAAAGG AGGGCTGGAT ATGTTAAATC 
TTACTCATGT TACCTTAAAA ACGCGACAAG TCATCTTGCA AGATGCGGAT TTTACCTTTA 
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AAAAGGGTAG GATTTATGGC CTTCTTGCTA TCAATGGCTC GGGAAAGACG ACACTATTCC 240 

GAGCTATGAG CAAGTTGCTT CCCCTTAGTA GTGGACACAT CGCAGTTCCT CCTTCTTTGT 300 

TTTATTATGA GAGCGTTGAA TGGCTGGATG GAAACTTAAG TGGGATGGAC TACCTTCGTC 360 

TCATAAAAAA ACATCTGGAA GTCAGACCTA AACTTGAGAG ATGAAATCGC CTACTGGGAA 420 

ATGGCTGACT ATATCAGTCT TCCCATCCGC AAGTATTCCT TAGGGATGAA GCAACGCTTG 480 

GTGATTGCTA TGTATTTTCT CAGTCAGGCC AAATGCTGGC TCATGGATGA GATTACAAAT 540 

GGCTTAGACG AGTATTATAG ACAGAAGTTT TTTGATAGGC TAGCACAAAT CGATAGACAA 600 

GAACAGCTGG TTCTTTTAAG TTCCCACTAT AAGGAAGAGT TGGTTGATAT CTGCGATAGA 660 

GTAGTAACCA TTCATCAGGG GCAGATAGAA GAGGTTTAGT TTATGAAAGA TGTTAGTCTA 720 

TTTTTATTGA AAAAAGTTTT CAAAAGTCGT TTAAACTGGA TTATCTTACT TTTATTTGCA 780 

TCTGTACTCG GTGTTACCTT TTATTTAAAT AGTCAGACTG CAAACTCAGT CAGCTTGGAG 840 

AGAGAGTTGG AAACTCGTCT TGTAGACCGT GAGAGAGTCA TCAATGAAAA TGAAGAGAAA 900 

CTCTCCCAAA TGTCTGATAC CAGCTCGGAG GAATACCAGT TTGCTAAAAA TAATTTAGAC 960 

GTGCAAAAAA ATCTTTTGAC GCGAAAGACA GAAATTCTGA CTTTATTAAA AGAAGGGCGC 1020 

TGGAAAGAGG CCTACTATCT GCAGTGGCAA GATGAAGAGA AGAATTATGA ATTTGTATCA 1080 
AATGACCCGA CTGCTAGCTC TGGCTTAAAA ATGGGGGTTG ACCG 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1089 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



1124 



CAGAGCAACT TCGTCAAATC TTCATAGGTG AGGTAACCAA TTGGAAAGAG GTTGGTGGTA 
AGGACTNACC CATCTCTGTT ATCAATCGGG CAGCCGGCTC TGGCTCTCGT GCTACCTTTG 
ATACTGTCAT TATGGAAGGT CAGTCTGCCA TGCAAAGTCA GGAGCAGGAT TCAAATGGAG 
CGGTAAAATC AATCGTATCA AACAAGTCCA GGAGCTATCT CTTATTTATC TCTTACCTAT 
ATAGATGATT CGGTCAAAAG CATGAAGTTG AATGGCTATG ACTTAAGTCC AGAAAATATA 
AGTAGCAATA ATTGACCCTT GTGGTCTTAT GAGCATATGT ATACATTGGG GCAGCCCAAT 
GAGTTGGCTG CAGAATTTCT CAATTTTGTT CTCTCGGATG AGACCCAAGA AGGGATTGTC 
AAAGGATTGA AGTATATTCC GATTAAGGAA ATGAAGGTTG AAAAAGATGC TGCCGGAACT 
GTGACAGTGT TGGAAGGGAG ACAATAATGA ATCAAGAAGA ATTAGCTAAG AAAATGTTGC 
TTCCATCAAA GAATTCTCGT CTGGAGAAAT TAGGAAAAGG TTTGACCTTT GCCTGTCTTT 
CTTTGATAGT CATCCTTGTG GCCATGATTT TGGTTTTCGT AGCGCAAAAA GGCTTGTCGA 
CCTTCTTTGT CAATGGTGTG AATATCTTTG ACTTTCTTTT GGGAGGAACT TGGAATCCTT 
CTAGTAAAGA ATTTGGTGCC CTTCCTATGA TTTTGGGTTC CTTTATCGTT ACCATTCTCT 
CAGCCCTTAT CGCAACACCC TTTGCTATTG GTGCAGCAGT TTTTATGACC GAAGTATCAC 
CAAAAGGGGC GAAGATTTTG CAACCAGCTA TTGAACTCCT GGTTGGGATT CCTTCAGTAG 
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TGTACGGATT TATTGGCTTG CAAGTCGTCG TTCCCTTTOT TCGCAGTGTC TTTGGTGGGA 960 

CTGGTTTTGG GATTTTGTCA GGGATTTCCG TCCTCTTTGT CATGATTTTG CCGACCGTAA 1020 

CCTTTATGAC AACGGATAGC TTGCGTGCGG TTCCTCCNTT ATTATCGTGA AGCCAGTTTC 1080 

GCTATGGGA 1089 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1502 base pairs 

(B) TYPE: nucleic acid 
(Ci STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

CTNTNTTTAA ACCTTGACCT TANTTGANNA TTATACCGAA ATTNTCTTCA TTTTTAAAAG 60 

AAAAAAGGGC GCTGGTAAAA GGGATAATCT TCACCAACTC CCTATNTTTC TACTTATCTA 120 

AGCCTAATTC TGCCAAGATT TGACGTTTGT NGGCAATCTT TNGGACTTCN TNGTCCTCAT 180 

CTTCAGACCA GTCTAGTTTA ATTTCTGAAA CAATCTGCCC AGGGNGATTT NTCAAGATAT 240 

AGATGCGGTC GCTGAGANTG AGGGCCTCCT CAATACTATG CGTGANGATC AGGGTTGTTA 300 

GCTGCAACTG CTTGTGAATC TCAAGATACC AAGCGTGGAG TTCCATCTTT GTCATCTCAT 360 

CCAAGGCGCT AAAGGCCTCA TCTAAGAGAA AGAGCTTGTG CCCAAAAAGG TAGGTCCGGA 420 

GTAAGGCTAC ACGCTGGCGC ATCCCACCGC TAAGTTCATG AGGATACTTG TCTCTTACAG 480 

CTGTCAGCTG GAAGGTCGCA AGAATTTTAT CCGCTCGGGA AATAGCTTCT GCCTTATCCA 540 

CCTTTTGAAT CAAGAGGGGC AGAATGATAT TTCCAAGCAC CGTCTTGTGC TCCAAGAGCA 600 

GATCCTTTTG CAACATATAA CTCACGTGCC CCTTGGGATT TTCTTCACCA TCAAGGACAA 660 

TTCTCCCTGA CTGAACTTCT AAAATCCCAG CGATTAGATT AAAGAGGGTG GTCTTTCCAA 720 

CACCACTTGG GCCTAGGATG GAAACCACTT CGCCTGAAGT CACCTGTAGG TTGATATCCT 780 

CTAAAATCCT CTCCTGACCA TAGGCATAAC TGACGTGCTC TAGTCTAATT TCTGTCATTA 840 

TTTCACAAAT TCGTTGGTGA AGCCTTTGTC TGTCAAGTCT TCTTTAAGGA TACCATTTTC 900 

TTTATCCCAT TTATAGAAAG CATTCCAGCG AGCTGCGTCA AATTGACCCC ATTTTTCCTT 960 

GTCGCTTGCG TATTCTTTTG ACAAGTATTT TTGAGATTCG ATGACAAAGT CACGTTTTTC 1020 

CTTGAGTTCA GGTGCATTCT TGATGAGAAT ATCTGCAGCT TCTNCTGGAT GTTCCATGGC 1080 

ATATTGGTAG CCTTTTNTGA TGGCTTGGAT GACTTTGCGA GCTTCTTCTT TGTTATCTTT 1140 

CAGATAGTCG TTGTTTGCGA TGATAACTGG TGAATATTAN TCAAACTCCT TGACATAGTC 1200 

TTTCAANTAC ATGAAGTTAG CATCTACACC TTGAGATTTA GCAAGGATAC CATCCCAACC 12 60 

GTAGTAAATC CNAGCAGTAT CAAAGACGCC ATTGGCAATC GGTGTGATTG AGTTTGAGTC 1320 

NTTANTTGGT ACTNTTTCAA CCTTCTCACA GTCTCCNCCT TGAGATTCTA CCAAGGTTTT 1380 

CAACATACNN GTTCANTTGG GTCATTCCAT GTCCCATATT TCTTACCAAC CAAGTCTTTT 1440 

GGACTGCCTA CATTNTCACA TTACCGAGAG ATGATCCCTG ATGTATTGTG TTCCACCNAT 1500 

AG 15C2 
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(2) INFORMATION FOR SEQ ID NO: 66: 



PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



ACTCTTCAGC 


GATTTCTTGG 


CTGAGAATGT 


CCTCATAGGC 


AGGGAGTTGG 


ATGTCCTGAC 


60 


CATATTTCTT 


CTTGAGGGCA 


GTAGTAACTA 


GGCGATAAGC 


CAGATTGGCA 


TTACGAGAGA 


120 


AGATAGGCCC 


GNGGAANTAC 


GAACCAAAGA 


CATTCTTATA 


ATGAACCCCT 


TCACCGACCT 


180 


TTTCTTCGTT 


GTTTCCATTT 


CCATAGACAA 


CCTGTCCCAG 


CGGTTTTTGG 


TCATCAGAGA 


240 


GGAAGGTACG 


ACCTTGGTGA 


TTTTCAAATC 


CATAGTAGGT 


TTCATCGAAA 


TCTTTCATTG 


300 


TGAATCTTGA 


TGTCACCGAT 


AAAACGGTTA 


TTGGTCTGGT 


TGAGCGTGTA 


GTGTCCCATG 


360 


ACCCCTAGCC 


CTTCGATACG 


TTTTCCTGAA 


GCTTCAACAT 


AATATTGACC 


CAATAGTTGG 


420 


AAACCACCGC 


AGATAGCCAG 


AACTACACCG 


TCGTTTTGGA 


TGTAGTTGTC 


AATGCTCTCT 


480 


TTTTTAGCAG 


GTAGGTCGTC 


TGCAATGATA 


CTTTGTTCAA 


AGTCTTGACC 


ACCACCGAAA 


540 


AAGGCGATGT 


CGTAGTGATT 


TTCATCAAAG 


TCATCATGGA 


GAGAAACGAT 


GTCAACGGTC 


600 


ACATGGGTTC 


CCAGTTTTTC 


AGCCACATAC 


TTGAGCATGA 


GGATGTTTCC 


ATTGTCCCCG 


660 


TAGGTATTCA 


TGAGATTTCC 


GTAGAGGTGG 


GCAATGTTGA 


GCTGATAGGG 


GTAATTGCCA 


720 


TCTTTTGAGG AAAGTGAAGT 


ATAAACCATT 


AGTTCATCTC 


CTTTCTAACA 


ATCTGACGAC 


780 


TAGCCAGCAG 


TTCACGAAAT 


TCCAGCATGG 


CAGTATAAGT 


TGCCAGAATA 


TAGGCATGCT 


840 


TGCAGTCTTG 


ATTCTCAATG 


GTCTTGAGAA 


CTTGCTCCAG 


ATTACTCGTT 


TCAGTGATTT 


900 


TCTCAGCTGG 


ATAGCCAGTC 


ACTCGGAGGC 


GACGAGCGAT 


TTCAGAATGA 


CGAACACCGC 


960 


CAGCGTTGAT 


TTCAGGAATG 


TCCATGTCAG 


TGATTTGCTC 


AAAGTCTGCA 


TCCCAGATCC 


1020 


AG 












1022 



(2) INFORMATION FOR SEQ ID NO: 67: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 



CCTCCAGGTC TCGGGAAACG ACCATGGCCT TTCTTATTGC CAACGAACTG GGAGTCAATC 

98 
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TTAAGCAGAC GTCGGGTCCA GTCATTGAAA AAGCCGGAGA TCTGGTAGCT ATTTTGAATG 120 

AGTTAGATCC TGGGGATGTA CTTTTTATTG ATGAGATCCA TCGTTTGCCA ATGTCAGTGG 180 

AAGAGGTGCT TTATAGTGCT ATGGAGGACT TCTACATAGA TATTATGATT GGGGCTGGTG 240 

AGGGTAGTCG TAGTGTTCAT TTGGAGTTAC CACCTTTTAC CTTGATTGGT GCGACGACTC 300 

GGGCTGGTAT GCTCTCCAAT CCGCTACGGG CACGTTTTGG GATTACAGGC CATATGGAGT 360 

ATTATGCCCA TGCTGACTTG ACAGAAATTG TCGAGCGGAC GGCAGATATT TTTGAGATGG 420 

AAATCACTCA TGAGGCAGCA TCTGAGTTGG CCCTACGTAG TCGTGGGACC CCTCGTATTG 480 

CCAATCGTCT CCTCAAGCGC GTGCGCGATT TTGCCCAGAT AATGGGGAAT GGGGTAATTG 540 

ATGATATTAT TACCGATAAG GCTTTGACTA TGCTGGATGT TGACCATGAA GGTTTGGACT 600 

ATGTGGATCA AAAAATCCTT CGTACCATGA TTGAGATGTA CAGTGGAGGA CCTGTTGGTC 660 

TAGGAACTCT TTCTGTTGAA TATCGCCGAA GAACGTGAGA CAGTTGAAGA TATGTATGAG 720 

CCTTACTTTG ATTCAAAAAG GTTTTATCCA TGCGGACACC GTCCTGGACG GGTGGCGACT 780 

GCCTAAGGCA TATGAGCACT TAGGTTATGA ATACAGTGAA AAATAAGCAA GAAATCCTTA 840 

GAGGCTTTTT AGAGAAAATC CAGATATGAT GGCCATTCTG ACGATCATCC GAGACCTTGG 900 

TCTGAAAGAC TCGTGGTTGG CAGCAGGTTC TGTCAGAAAT TTCATTTGGA ATCTTTTGTC 960 

AGACAAATCC CCTTTTGATC ATGAAACAGA TATAGATGTT GATTTTCTTT GATCCCAGAT 1020 

TTTTCTTATG AGGAAACCTT ATTACTGGAG AAAAAGCTGT GAGAGGATTT TCCTCAGTAC 1080 

CAGTGGGAAT TGAAAAATCA GGTCTATATG CATCAGCACA GTCCTCACAC TGCTTCCTAT 1140 

ACCAGTTCTC GTGATGCTAT GAGTAAGTAT CCAGAACGGT GTACGACAGT TGGACTGCGC 1200 

TTGAATGAAG AATCAGATTT TGAACTCTAT GCACCTTATG GTTTGGAGGA TATTTTGAAT 1260 

TTCAAGTTCG TCCAACTCCT CATTTCTTAG AAAATGAAGA CCGAATGGAA CTCTATCAAA 1320 

CACGTTTATC CAAGAAAAAT TGGCAGGAGA AAT X353 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS : 
fA) LENGTH: 560 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



CTCCTGGAAA TTCTGGCAAG TAGGCTGGAT GAATGTTGAC AATCCGACCT TCATAAGCCG 60 

ACAATAAGGT TGGTCCAACG ATTTTCATGT AGCCTGCTAG GCAAACCAAG TCAATCTGGT 120 

GTTCTTCCAA GAGTTCGACA AGGGCTGCTT CGTAGTCTGT CTTGCTCTCA AACTCCTTGA 180 

GTTCAAAAGC ATAGGACAGA ACGCCGAGCT GCTTTGCACG CTCAAGCACA TAGGCATCAC 240 

GATGGTCTGA AAAGACAAAC TCCACTGGAA ATTCTTCGGC AATCACCTGA AAATTTGAGC 300 

CATTACCAGA GGCAAAAACC GCTATTriTT TCATTTGATA ATGACACTTT CGTTTTCTTT 3 60 

CTTGACGATG CGACCAATTT CATAGACTGC TTCATCCAAC AATTCTTTTA CACGCTCTAC 420 

ATTTTCAGGG CTGACCGCCA ACATAAGTCC CACACCCATA TTGAAGATTT CAAACATTTC 480 

TTCGTGTTTA ATCTGACCAT ATTTTTCAAG GGCTTTGAAA ATTGGAAGCA CTGGAACTTT 540 
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(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 860 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



CTTCAACACT 


TTATTTTTAT 


CTTTCTCCCA 


TTTCGTATGC 


TAGTTCGGAA 


ATTGTTGCTA 


60 


GAACCTCTCC 


GACTATTTGG 


GATGTTCTCA 


TTGCTTTTGT 


AGGAGGGATA 


GCAGGTATTA 


120 


TTGGTGCTAG 


GAAAAAAGAG 


ACCAATAATA 


TTGTTCCTGG 


TGTTGCTATT 


GCAACCGCCT 


180 


TGATGCCTCC 


TCTTTGTACA GTAGGTTATG CTATTGCTTC 


TGCTAATCTA 


AAATTTATCA 


240 


TTAGGCTCCT 


CTTACCTATT 


CCTCATCAAT 


TGTAGCTTTA 


TTGTCATTGC 


GACTTATATA 


JOO 


GGTGTTAGGT TGATGATGGT 


TAAGAAACAT 


TATTTTAAAG 


ATAATGAAGA 


AGACTCTAAA 


360 


ATGCGTAGGA 


TTTTGCTTCT 


AGTTGCTGTT 


TTGCTGATGA 


TTCCGAGTTT 


CATCTCTGCA 


420 


ACGACTTTAG 


TGAGAGAAAC 


GTTGAAAAAA 


GAGTCCCTTA 


AGAAATTTAT 


ATCAGAGCAG 


480 


TTTCAGGGGC 


ATAATATTTT 


GAAAAAAACC 


TATTCTAAAA 


AGACTCATAC 


CCTAAAGCTA 


540 


ACCATTTCAG 


GAAATTATTT 


GACAGAAGAA 


GAACTCGATA 


TGATTTCAAG 


TAAGAGAGGT 


600 


GACTATGGTT 


TAAGTGATGT 


TTCTGTTCAA 


GTTTCACAAT 


TGTCTGATTC 


AGAACAACTT 


660 


AGTAAGGAAG AACTGGTGGA 


GTATTTCTTC 


CAGTATATCA 


AGGATAAGGA 


AGCAAAAGAA 


720 


AAGGAAAAAG 


CTAATAAGTT 


TTATACAGAG 


TCTGAGGAGC 


AATAATTTCT 


TGAGAATAGC 


780 


TGGTTTTTCT 


CGTGAGTCTT 


CTATGTATAT 


CAAAGGAAGA 


CTGAGGTTTT 


AAGTATGAAA 


840 


CTTTTCTTCT 


ATTATAGTAG 
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(2) INFORMATION FOR SEQ ID NO: 70: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 925 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 70: 

TCTTGGCCAA CTGCATGGAG TTCAGCGGTC AATTTCAACG CACCTGAGAA ACAGACCCCT 
GCACCCCTGA AATCTCAGGA GACATGATGO TCTGGATGGA ATCAATAATG AGAAAGTCTG 
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GCTGGATACG CTACCACTTC TGCACGAACA CTCTGCATAT TGGTCTCTGC ATAGAGATAA 
AACTCACTAT CAAAATCACC TAAGCGCTCT GCACGTAGTT TAATCTGCTG GGCAGACTCC 

TCCCCACTGA CATAGAGAAC TGTCCCCACT TGGGACAACT GGGTTGAGAC TTGTAGGAGA 300 

AGAGTTGATT TCCCAATCCC AGGATCCCCA CCGATGAGGA CGAGACTTTC CTGGTACAAC 360 

TCCGCCTCCA AGCACACGGT TGAATTCCTC CATCTCCGTC TTGGTTCGAT TGACATTGAT 420 

GGAAGTCACC TCAGCTAGTT TCATGGGCTT GGTTTTCTCA CCTGTCAAGG ACACACGCGC 480 

ATTCTTGACC TCGGCAACCT CAACCTCTTC CACAAAAGAA GACCAAGACC CACAGTTGGG 540 

GCAACGTCCC AGATATTTAG GGGAATTATA CCCACAATTT TGACATACAA ATGTCGCTTT 600 

TTTCTTTGCG ATGACAAACC TCTTTCTATA TCTCTAACTC ACACTCAATC ACTTGGCAAA 660 

AATCAATCTT CTCATTTGGC ACAAACTGGC GCATGAGCAT TCGATGAGCA ACAACTACCA 720 

CAGTCTGATG TTCTCGATAC TTAAACATAC ATTCTAGAAA CCGAGACTTC ATTTCCGTAG 780 

CTGTCTCATA TTGAATAGGA CTATTAGGAA GCAACTCCCC CTTGTTTTCT AAAAACAGTC 840 

TTCTAGCTGT TTCAAAGTTT TCTATTCCTG TTTTATAGAC CTGCCATTCA TGTAATAAAG 900 
GCTCTACTCT TAAAGGAAGA CCCGT 



925 



(2) INFORMATION FOR SEQ ID NO: 71: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE : Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


71: 






TTCTNGATCA AGTCAAGGAC 


TGCANGCCAC 


GACCGACATA 


GCGTTTGGCA 


ATGGAAACAA 


60 


CCAAACGAAG 


ATTGGCTTCC 


GCAAGACGTT 


GTTTGGCTTC 


GATATCGCCA 


GCTTCAACAG 


120 


CCAGTGCCAA 


CTCTTTCTCC 


TCTTCATTGG 


TCAAGAGAGG AACGACCCCT 


ATTTCTTTCA 


180 


AGTACATACG 


GACAGGGTCA 


TTGACCTTAG 


CAGAAGTTGA 


CCCAATCAAG 


TCCTCATCGC 


240 


TGAGTTCTGG 


TTCTTCTTCA 


TTGCTAAGAA 


CACGCGCACT 


TGGATTTCCT 


TCGTTATCTG 


300 


TGATAGAAAT 


GCCTGCATCC 


TGAATCCGTT 


GCAAGAGATC 


TTCAATCCCA 


TCAGCGTCCA 


360 


AGGTAAAAGG AATAACCAGA CTTGCATTGA 


TTTCATCATC 


TGTTGCTGTC 


CCTTTTTGCT 


420 


TATGATTACG 


GATAAATTCT 


GCTACCTGTA 


CGTCAAATGT 


TGTTACTTCT 


TTTTGTTTTG 


480 


TTGCCATTAT 


TACTCCATTC 


TTCTCTTTTG 


GGAAATTAAA 


CGTTCCAATT 


CTTCTAGGGC 


540 


TGTATCTGTA 


TCTCCTACAT 


GGCTAGCTTC 


CTGCACCTTC 


TTTTTGATTC 


CCATATTGTC 


600 


CTGATTCAAG 


AGAGCCTTGT 


TTCGAAGTCA 








630 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



PCTYUS97/05306 



(ii) MOLECULE TYPE: Genomic DNA 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



GCCAAACAAC GCATCCATAA TGGTTAAATC TTGATAAGGC AGAGCTACAT TCAAAACAGC 
TTGCTGGTTT GTAGGCTTTC AATCAGAGCA ATCACTTCTT CAACCTTGTC AGCATCAAGA 
GCAGCTGTCT CAATCTTTGT ACTTGTTTTG CCTTCCAGTT TAGCTTTCAA GTCATCGCAT 
TTTGACTTGG TACGGCTAGC AATCATAATC TCTGTAAATG TTTCGCTATC TTGACAAATC 
TTTGAAATAG CAACTTGGGC AACGCCCCCA CAACCAATAA CTAGTAAACG ACTCATTTTT 
TTCCTTCCTC TTCTTCTAAA ATGTCCTCAA CA.TACTTGGG CAACATAAAG GCTCCCACGT 
GTAAGTTTCC AGTGTAGTAT TCTGTGAAAA GCTGGCGTTT TTTCCAGCCT TCCTTGTCAA 
AATCTTTGAC AGGGTGGTAT TTTTTCGATG CAAATCCAAA CAACCAATAG CCAGCTGGGC 
TAGTTGGAAT ATGGGCCTGA TAAACCCGAC TGATTGGAAA GGCTTGATTG ACCTTGCGGT 
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CTAGGGCCAT CTTCTCACCT CCTCCTTGTC AGTACATTCT TGCAATAGAA AAAATAAGAT 
TGAGTCCCCC CAACCTTAAA TTTTTTCACC ATCTTCTTTT TCTTTAGCAA TTTGCTCTTT 120 
GATTTTCTTT TCTTCTTCTT CTTTGCGGCG TTTTTCTTCT TCGATACGGC GACGCACTGC 180 
TTCACGTTTT CCTTCTGGAT CTGGGTGAAT TGTAACGTTT CCTGATTCGA TTTCTTCTAA 240 
AGCGCGAAGA GTTGATTTTT CAGACTTGAA ACCTTGAGTT GCTGGGGCAC CTGCTTCCAA 300 
TTCGTGGGCA CGTTTTGCTT CCAAGATTAC GAGTGAATAT TTTGAAGGAA CCTTGTCGAG 360 
CAAGGTATCA ATAGAGGGTT TTAACATCAT TTGCTTGTAC CTATTTTCTA AATTTTATCG 420 
GGTAGTTGGA GATTTTGGTA ACATCTCCTG ATAGTGACCA ATGACACGAT CCACACAGAA 480 
GTGTTCTGCT TCAATCACAC ATTTGACACG TTCAGCAGCT AGGGGTACCT GATCGTTGAC 540 
AATCGCATAA TCATACTCAC GCATGAGGGC AATTTCTTCC TTGGCCTTTT CGATTCGTTG 600 
GGCAATCACT TCTGCACTAT CTGTTCCACG ACCTACCAAG CGTTCTTGCA ATTCATCCAA 
ATCTGGTGGT GTCAGGAAGA TAAAGACAGC ATCTGGAACC TTTTTCTTGA CCTGAAGAGC 
ACCCTGAACT TCAATTTCAA GGAAAACATC GATTCCCTTG TCCAAGGTTT CATTGACATA 
GGTCAGAGGA GTTCCATAGT AGTTGCCGAC ATATTCTGCG TATTCCAACA TCTGTCCTTG 
ACGAATCAG 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 



660 
7-20 
780 
840 
849 
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GCATGCTTCG GCAGGCCGAC TCATCCTCGT CAAAGAAAGG A 



PCT/US97/0S3O6 

581 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



TTCTGAAAAT 


CCATACTTCT 


ACCAAGGAGA 


TACGCTAGCG 


GTCTCGGAAG 


TTCTCATACT 


60 


TCTATCGCTA 


TATCTGGCCA 


TCGCCTTTCT 


ATCCAAGGAT 


TGACAACAAG 


AGATAAGGCA 


, 120 


GAGAAAAAAT 


TCTTGCTGGA 


TCAGCTGGTT 


GCCTGCGATG 


GTGGTACAGG 


TGTCATGCAC 


180 


GAAAGCTTCC 


ATGTAGATGA 


TCCGACCCTC 


TACTCTCGTG 


AATGGTTCTC 


CTGGGCTAAC 


240 


ATGATGTTCT 


GTGAGTTGGT 


CTTGGATTAC 


TTGGATATTC 


GCTAAGGGGC 


TCGCTTTAGC 


300 


TCAACCGATT 


CTTATCAGAA 


TCACAAGTTT 


ACATTTAAAA 


CGTTAAAATT 


TAAATTTAGA 


360 


ATGAGGTTTT 


ACTTCATGGA 


AAATGTTGTT 


GTACATATTA 


TCTCACATAG 


CCACTGGGAT 


420 


CGTGAGTGGT 


ACTTGCCTTT 


TGAAAGCCAT 


CGTATGCAAT 


TGGTGGAATT 


GTTTGACAAT 


480 


CTCTTTGATC 


TCTTTGAAAA 


TGACCCTGAG 


TTCAAGAGTT 


TCCACTTGGA 


TGGACAAACT 


540 


ATTGTCCTTG 


ACGACTAACT 


TACAAATTCG 


CCCTGAAAAT 


CGCGACAAGG 


TCCAACGCTA 


600 


CATTGACGAG 


GGCAAGCTTA 


AAATTGGTCC 


CTTTTACATC 


TTGCAGGATG 


ATTATTTGAT 


660 


CTCCAGTGAA 


GCCAATGTCC 


GCAATACCTT 


GATTGGTCAA 


CAAGAAGCTG 


CCAAATGGGG 


720 


TAAATCAACC 


CAGATTGGCT 


ACTTTCCAGA 


TACCTTTGGA 


AATATGGGAC 


AAGCGCCTCA 


780 


AATTCTTCAA 


AAATCAGGCA 


TTCACGTGGC 


GGCCTTTGGT 


CGTGGTGTGA 


AGCCGATTGG 


840 


ATTTGACAAC 


CAAGTCCTTG 


AAGATGAGCA 


GTTTACGTCT 


CAGTTTTCAG 


AAATGTACTG 


900 


GCAGGGTGTG 


GATGGTAGTC 


GTGTTTTAGG 


TATTCTCTTT 


GCCAACTGGT 


ACAGTAACGG 


960 


GAATGAAATT 


CCAGTTGACA 


AAGATGAGGC 


CTTGACCTTC 


TGGAAACAAA 


AATTGTCAGA 


1020 


TGTGCGTTGC 


CTACGCTTCG 


ACCAACCAAT 


GGTTGATGAT 


GAAACGGCTG 


TGAACACCAG 


1080 


CCTGTCCCAA 


AAAAAATCTG 


AGCGAAGCCA 


TTCCGTGTGG 


CAAATGAACT 


CTTCCCGGAT 


1140 


GTAATCTTTG 


TTCATAGTTC 


TTTTGATGAA 


TATGTTCAAG 






1180 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



CTCAAAAAGT 


CTGAAATTGG 


ATCATTTGGA 


ACCATCTGAG 


CAGTAAGTTT 


AACACCTTTT 


60 


TCTGCCAGTT 


TGTGGAAGTC 


TTCCACATCC 


TTGTCTACTA 


CGTTGATAGA 


ACGTGTAATA 


120 


GAACGAGTTT 


CTGGTGTTTG 


AGACATATTC 


CCAACATTAA 


GGGTTTCAAG 


TGGTACACCT 


180 


GCTTCTACCA AACCAAGGGA 


AGCGGTCTGG 


TTTACGAGCC 


ACGATAAAGA 


GACGTTGGCT 


240 


ATCGTATTTG 


CCAGCAAGAA 


TATTGGCTGC 


AGCTTTCTCA 


ACTGGCAAAA 


TACTCAATTT 


300 


CACACCTGGT 


GGTGTCGCAA 


GTTTCAAACC 


ACTCTTTTCA 


ATATCGTTGT 


TGACAACTTC 


360 


GTCGTCTACA ACCATAATGC GTGAAACATT 


TAGTTTTCCA 


GCCCAAAGAT 


TGGCTACTTG 


420 


TCCGTGGATC 


AAACGTCCAT 


CAATACGGCA 


TCCTACAATT 


GTCATAAGTT 


TTCCCCCTTT 


480 


ATATGTTTTA 


GTGTAGGTTT 


ACGAGTTAAA 


TGAATCTCTT 


CTTTATATTG 


ACCTTCTGTT 


540 


TCAAAGATAA 


TGATGCGGTT 


GGCACCTTCC 


TTGAGATAG 






579 



(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 736 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 



60 
120 
180 
240 
300 
360 
420 



CGCATANCCT AAAAGTTTTC TTTGACATCT TCTGGAATAT CTGTATCATT AGCTTCCTGA 
ACATCTCTAC CATAAATCCC AAAGGCTGGA ATCCCTTTTT GAGCATGTGA AGCTAGTACA 
GCTGCAAGAT AGACAGCTCC TGGGCGTTCT GTCCCATTAA ATCCCCAAAT AGCATGAGGA 
ATATCTGGAG ACATATCCAT AGTTTCACTA CCATAACACC AGCATGGTGT AACTGTAATT 
GTTGCGCAAA CATTTGATTT TTTAAACAAC TCATGGGAAG CTGCAGCCTC TGGAACACGA 
CCAATGGTAG ATGGAGAAAT CACACATTCC ACAGGTTCCC CATCTGGATA TTTCAATGTG 
CTTGAAATCA AATCTGCCAC ACTTTTAGCC ATGTTCATTG TTTGTACTTC AAGTGATTCG 
CGTACACCTT GACGACGACC ATCAATAGTC GGACGAATCC CAATACGTGG ATGTTGAATC 4 80 
ATACTATTTT CCTCTTATTT TTCTGTTTCT TTCATACTAC CATCGCCATT ATATTTACGG 
TAGCCTGGGT GACGACCTGT AACCTTATAA TTTTCTCGCA TTGAGAATAG ACGTTCTAAA 
GTCGGACGAG CAATTTCTTG CTCCTCAATG CCCTTTGTAG AAAGTAACAT TCTTCCGTGG 
AAGGTTGTCT TTGCGACTAA TTCTAAAGTT TCCATACGGT AGTATGCTGT AATTACATCG 
CTTCCGACAG TCAGAG 

(2 ) INFORMATION FOR SEQ ID NO: 77: 



540 
600 
660 
720 
736 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 590 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



CTCATTGTAT 


TTAGGAGAAA 


TGATGGTATC 


TTCCAAATCA 


AAATCAACTT CACTCCATAG 


60 


TCTCAACGGA 


TTGATTTTCC 


CATCTTGATA 


GGTCACATCC 


TTGTCAAGGA TAAACTGAGT 


120 


CAACGCCTCA 


TGCTGACCTT 


GACACCTGAT 


GTCATCTACC 


AAGAGCCAGA 


CATCCTCTAC 


180 


CAACATGAGG 


ATTTTTCTCT 


TGTGAAGATA 


AGGCAAATCA 


GGTTCTGCTG 


ACCAATAAGC 


240 


CCCTTCAATA 


TAATGCATTC 


CCTCCCTTTC 


TTTATGGTGA 


CAAAACAGGG 


AGTGAGGATA 


300 


GTATTCATAT 


TCCCAGGATC 


CCGTGATTCT 


TTCCGGAGCT 


TTCCCATCTA 


CAATGCAGGT 


360 


CGAATGACTC 


CAAGCACTCT 


TTAAGAGATA 


ACGTTCATAT 


ATCTCCCGAT 


AAGAATAACG 


420 


CCCAGCATCT 


ATGAAAATAG 


GTTGGCCTTG 


ATACTGTAAG 


CAAAAACTAT 


TCTCGTCACT 


. 480 


ATGGCTATGG 


GCACTTCCTA 


GCGGACCATT 


TTTGAAAAAT 


AGATAACGAT 


GTTCATCCTT 


540 


AATGCAGACA 


TGTCCAGAGT 


CTTCAAAGAT 


CATGGACTTA 


GGCTGCCAAG 




590 



(2) INFORMATION FOR SEQ ID NO: 78: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 



CTAGTGTAGC 


TTCCTTAATC 


TTATCTGATA 


AGATAGCTGT 


CATATCAGAC 


TCAATCATTT 


60 


CCTGGAGCAA 


TCACATTGAC 


TCGTATATTC 


CGACTAGCGA 


CCTCGCGTGC 


CACAGACTTG 


120 


GTAAAGCCAA 


TCAAGCCAGC 


CTTAGAAGCA 


GCATAGTTAG 


CTTGACCAAT 


ATTCCCCATC 


180 


AAACCAACAA 


CACTAGACAT 


ATTAATGATA 


GCACCTTCTC 


TGGCTTTCAA 


TCATCGGTTT 


240 


CCAAAGACTG 


ATTGTGTCAT 


ATTAAAGGCA 


CCAGTCAGAT 


TGACCTTGAG 


CACTTTTTCA 


300 


AAATCTGCTT 


CTGTCATCTT 


GAGCATAAGA 


GTATCTTGGG 


TAATCCCTGC 


ATTGTTGACC 


360 


AAAACATCTA 


CTGAACCCAG 


TTCTGCAATA 


GCTTGATCAA 


TCATACGCTT 


AGCGTCTGCA 


420 


AAATCTGATA 


CATCTCCTGA 


AATGGGAACC 


ACCTTGATAC 


CATAGTTTGA 


AAACTCAGCG 


4B0 


AGCAATTCTT 


CTGAGATTGC 


CCCACGACTG 


TTTAAGACAA 


TGTTGGCTCC 


TGCTTGAGCA 


540 


AACTTGTGGG 


CGATGGCAAG 


ACCAATTCCA 


CGACTCGAAC 


CTGTAATAAA 


GATATTTTTA 


60C 


TGTTCTAGTT 


TCATTTTTTT 


CCTTTCAAAA 


CTTCTACTTA 


TTTTAGTCTA 


TTTTTCTAAA 


660 


AGTGCi'ACTA 


AACTCGCTTG 


ATCTTCCACA 


TGAGCTAAGT 


GAGCAGTTTG 


ATCAATTTTT 


720 


TTAACAAAAC 


CTGACAAGAC 


TTTCCCCGGT 


CCAATCTCGA 


ATAAAGTTGC 


TTATGCCTGC 


780 
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TTCTTGCATG ACCCCAATAC TTTCATAGAA ACGAACGGGT TCCTTGACCT GACGCGTCAA 840 

GAGCTGAGCA ATGTCCTCTT TTTGCATCAC AGCAGCTTCT GTATTGCCGA CTAGGGGACA 900 

AGTAAAATCT GAAAAACTTA CCTGAGCTAG AGTTTCAGCT AGTTTCTGGC TAGCAGGCTC 960 

AAGGAGAGCG GTGTGAAAGG GACCTGACAC CTTAAGAGGA ATCAAGCGTT TGGCACCTGC 1020 

TTCTTGCAAA AGTTCAACCG CTCGATCAAC TGCAACCACT TCTCCAGCAA TGACGATTTG 1080 

TGCAGGTGTG TTATAGTTGG CTGGAGTAAC CACTCCAAGT TCCAGAAGCT TTTTGACAGG 1140 

CTTCTTCAAT GACCTCTACT GGCGTATTGA GAACTGCTAC CATCTTGCCA AGTTCAGCA 1199 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 

CTACTATTTT CTTTGCCATA GCCTTCTCCT TTACACACTA CGCATATCGT GGTAAGAAAC 60 
ACTGCGTCCC ATCTCACCTG CATTCTCTTT TTGAACAAAG GTATTAGCGT TTATATAGGC 120 
AATAGCAGAA GCCTTCAACA CATCAAAATC AAGCCCTGCT GCATTAAAGA TGGTTTCTGT 180 
ATCCCTGTTT TCAACAGTGA CCAACAACCC GATCCTGGGC ATCGATTCCA TCTGTTACCG 
CATTGATAGT GTAGGACACC AAACGAACAG ATTGGTTAAA GAACTTATCG ATAGCGTTAA 
AGATTGCTTC AACGGAACCC TGTCCCTGTC GCATTAAATT CGACTTTCTC ACCATCCATA 
TTGGCTAGGC TAACGAGCGC TTCAATGTCA TTATCTGCAT GAGTTTGAAG TTGTAAATCA 
TCAAAGTGGA AGCCTTCTGG ATTTTCAACC ATGGTTCCAG CTACCAAAGC TCGAGTATCT 
GCATCTGTGA TTTCTTACTT CTTATCGGCC AGTGCCTTGA ACTTAGCAAA GAATGGTTTG 
ATATCCTCTT CTGTAAAATC TAAGGCCAAT TCTCTCAGTT TCTCAACAAA AGCATGGCGA 
CCAGATAATT TTCCAAGCGG AATCTTAACA CCAACCAATT CAGGTGTGAT GATCTCATAA 
GTGAGAGGAT TTTTAAGGAC TCCATCTTGG TGAATACCAG ATTCGTGGGA GAAGGTATTG 
CCACCAACGA CGGCTTTGTT TTTAGGAACT GGAATACCAG AGAAGCGAGA AACCATTTCT 
GACGTATTGA TGGTCTCATT TAGGACAATA CTGGTTTCTA CTTGGTAGTA ATCTTGGCGA 
ATATTGAGAG CCACTGCAAT TTCTTCCAAA GCAGCATTTT CAGCTCGCTC CCTAATACCA 
TGGATAGTCT CTTCAAAAAG TCCTGCACCA TTCTTGACAG CAGCAAGGCT ATTTGCCACT 
GCCATCCGAA GTTCATCATG ACAGGGAGGC GAATAGATGA TCTGACGATC CGTCTGGACA 
TTCTCAATCA GGTATTGGAA GATGGCACCA CATTCCTCTG GTGTGGTAAA TCCTATATTT 
TCTGAAAATT TCTTCAGTAA AGAATATTTA GCTAATTGAA AGTTCATGAA AATTATTAAA 
ATATTTCATT TTTTAGAGGT TAAGTTCCAA CTTTTTTCTA TCAATTCCAG TACTTNTTCA 
TCTGATAAAG TATCATCAAG GGACACACTA ATCCAGTAGC GCTTGCTCAT ATGGAAGGCT 
GGATAAATCC CCTTTTGTGA AAGCAAATTA GCTACTTGGT CATGCTTGAG GTTGACTGCT 
TCCACTTGTC CTTCTCTGCC CTTTTCCAGC TTATTCCAAG AGATTTTCAN CAAGACGGCA 
TACCACTTTT GATTGCCTTC ATGGCGCAAT ACAG 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(Di TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



60 



CTCCCCATTT TGGAAAATTT CTGTCAAGAA ACGGCGAACC AGCTTTTTAT TTTCTGCTTT t>u 

CTTGTCCAAA TCCTTGATTT CAAAATCTCC AAAAATTTGA TCTAGTTGGT CATTTTCAGG 120 
TGTTCGATAG TAGTCAATGA CATCCCAATG CTCAACAATA CGACCATTCT CATCCGCACG - 180 

GAAAGTATCC GTCGTCACCC ATTGAGCTTC TCCACCATTC AGATATTGAT GGAACATGAA 240 

CAAAGACCAG ATTGCCATCC TCAATGGTGC GGACAATCTT AATCTGACGC TCTGGATGAC 300 

GCTCAAAGAA ATCTGCAAAG AAGGCTGCAA ATCCTTCTTT CCCGTCAGGA ACACCTGTCG "360 

AATGTTGGAT ATAGGTATCC CCTACAGACT GGGCTTGAGC CTCAGCAACT CGTCCGTCTT 420 

GAATGGCATG GATGTATAGG TTGTGAGCAT TTTTCACTTG TTGTGACATA TTCTAAACCT 480 

CATTTCCCTT CTCTTTCAGA TTCGCCAAAA TTCTTTCTTG AAAACCTTCA AATTGGTGAA 540 

TTTCTTCCTC TGAAAATCCT TTGTAAAAGA TAGTATCCAA TTTCTGACTG ACACGATGCC 600 

CCACTTCTTT CTGGGACTTG CCTAACTCCG TTAAAACTAA ATACTTCTTA CGCTTGTCTT 660 

TTCCACACGG ACTAACAATT ACAAGCTTTT GTTCCTCTAG CTTTTTTATC ATAGTCGTCA 720 
GCGTATTATT CGCAAGCCCA GTCGCAAGCG 



750 



(2) INFORMATION FOR SEQ ID NO: 81: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



GTATCGTTCA ATTCCAAAGA TACGGAGGAA GCTGCGTTTC GCGAGAAAAC TCTTGTTAGA 
CCAGATAGGG ATTAGCCAAA ATCCTTCTAT GNGCTTATAT CCAAATGGAG ATGCAAACTT 
CTGCCCAACT CTTGACCAAT AAGATCTTTC TTAAAAATCC ACTCAAGGCT CTGGTAGAGG 
AAAAATATGG GATTGAGTAT GAAGAATTTA CCAATCCTTG GCACGCTGCC ATTTCTAGCT 
TCGTTGCCTT TTTCCTTAGA AGTTTGCCTC CAATGCTGTC AGTGACCATA TTCCCAAGTC 
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AATATCGCAT CCCTGCTACT GTCCTTATTG 
CTAGTGCCAG ACTTGGAAAG GATCCGACTA 
GTCTCTTGAC CATGGGAGTT ACCTTCCTGC 
GAAATACCTC GATTTTGAAG TCGAGGTATC 
TTCTTGAAGT AATCATGAAA ATCAGCACAT 
AGGTCACAAT CAACATCATA GTCGTATCCA 
GCATATGGTA GGCAAAGGCG AGATGTATGA 
TTAAAACCTG TTTGTTGATG GTTTGCTTGA 
AGATAATAAA GCGTTCACGG TCTTCGTAGC 
GAACAGTTCC GACCATAAAG ATAATGGATA 
AGAGGACACT CATTTGAGCA CTAGCATCTG 



PCT/US97/05306 

TCGGTGTGGC CCTTCTTCTC ACTGGTTACA 360 

GAACAGCTAT GATTCGGAAC CTTGCTATTG 420 

TCGAACAACT TTTCAGCATT TAGAATACAA 480 

TTTTTTACAT TTGCACAATC TTGCGATAAC 540 

AGGCGATGAG GAAGATAGCG CAGATAGACA 600 

GTACACCAAT CACTTTTAAA ATCAGGCTAA 660 

AGGCAAAGAG CAAAGGAAGG AAGAAAACAG 720 

TTTGCTTTTG GTCCAAACCG ACTTTCTGCA 780 

CTTCAGAAAT TTGTTTGTAG TAGATGACCA 840 

GGAAAATACC GATAAAGAAG ACACCGCCAA 900 

CTAGATTGCT ACCATAAACA TAG 953 



(2) INFORMATION FOR SEQ ID NO:B2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 665 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



CTGATTGAAC 


TTCTATTTAC 


TAATATTCAA 


AAATCCTCCG 


TTTCAAAGAG 


CAGGGAACTC 


60 


TTTGTGACAG 


AGGATTTTTT 


CTATAGGGCT 


TTAGCAGCTG 


CAATTGCGGC 


TTCGAAGTTT 


120 


GGCTCAGAAT 


TGATATTATC 


CACGTATTCA 


ACGTAGCGAA 


TCGTATTGTC 


AGTATCGAGG 


180 


ACAAAGACTG 


CGCGTGCTAA 


TAGGTGCCAT 


TCGTTGATCA 


AGAGGGCATA 


ATCGCGCCCG 


240 


AAAGAATGGT 


CAAAGTAGTC 


TGAAAGCATA 


ATGGCATTGT 


CAAGGCCTTC 


AGCACCGCAC 


300 


CAACGTTTTT 


GAGCAAAAGG 


TAGGTCCATT 


GAAACAGTCA 


ATACGACCGT 


GTTGTCCAGT 


360 


CCAGCCAATT 


CTTCATTAAA ACGACGTGTT 


TGAGTTGAGC 


AGATGCCTGT 


ATCGATAGAA 


420 


GGAACGACAC 


TCAAGACTTT 


TTTCTTGCCA 


TCAAAATCAG 


CCAGAGATTT 


TTTAGAAAGA 


480 


TCTGTTGTAG 


TAAGAGAAAA 


ATCAAGCGCC 


TTGTCGCCGA 


CTTGTAGTTG 


TTTACCTGTA 


540 


AAGCTCACAG 


GATTTCCGAG 


AAAAGTTACC 


ATAGGATACT 


CCAATCTTTT 


TTCTTCCATT 


600 


GTATCTGAAA CAGTCAGAAT 


TTTCCAATGA 


TTTGACCGGA 


AATGTGGGCA 


TAGAAAAAAC 


660 
665 



(2) INFORMATION FOR SEQ ID NO: 83: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1039 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 



CTAAGTATAA 


AAAACTTGCC 


GAAAGTATCT 


CTGAAGTTAA 


TTTGAATGTC 


TTTTTTCAAA 


60 


ATGCTTTAGA 


TTTTGTTGAA 


ATCTTTAATC 


AAGAGCCTTG 


TCTGATAAAT 


TTTTTTGAGT 


120 


TTATGAATTC 


TATGTTACAA 


GGCGATATTC 


CAAAGTTAAC 


AATAAACCAA 


GAATAATAAG 


180 


AGGGGAACAG 


TATGAAAGAT 


ACGATTTCTA 


ATAAGGATTT 


GATTTCCATG 


GGCTATAGAC 


240 


CATCAACAGC 


AAATGCTATT 


ATCCATCAGG 


TGAGAGAATT 


ACTTGTATCA 


CGAGGCTATA 


300 


CATTTTATAA 


TCGCAAACGT 


TTGATGGTTG 


TTCCAAAAAG 


TGTTGTGAAA 


GAGTTGTTGG 


360 


GAATGGAATT 


GTGAAATGGC 


TTATATCGAG 


TATAAACAGC 


GTGGGAAGAA 


AAGGCTTTGG 


420 


TCGTTTTCTA 


TACGTGAGAG 


GAGCAAGAGC 


CTACTCCATA 


AAAGCGGATT 


TAAAACAAAA 


480 


CGAGAAGCTA 


AAATAGAAGC 


GGAGAAAGTT 


CTTCATAAGT 


TAAATACTGG 


GAGTGTCTTA 


540 


AGCTCTAGTA 


TGACTTTATC 


GGAGCTTTAT 


AATGAATGGC 


TGGATTTAAA 


AATTTTACCT 


600 


AGCAATAGAA 


GTGTAGTTAC 


TAAAAAAAAA 


TATCTTATGA 


GAAAGAAGGT 


CATCGAAAGG 


. 660 


TTATTTGGAA 


ATAAGCCTGT 


ATCACAAATT 


AAGCCTAGCG 


AGTATCAAAA 


AATTATGAAT 


720 


GAGTATGGAG 


AGACTGTATC 


GAGAAATTTC 


TTAGGAAGAT 


TGAATTCTAG 


TATCCAGGCA 


780 


AGTATACAGA 


TGGCTATTGC 


TGATAAGGTG 


ATAATAGAGG 


ATTTTACTGC 


TTATGTTGAG 


" 840 


TTGTTCTCTT 


CTAAGAGTGG 


ACACAAAGGT 


TGAGGAAAAG 


TATCTACATA 


CTGAGTCAGA 


900 


TTATCAAAAA 


GTTTTAGTAT 


ATTTGAAAAA 


TAAGTTTGAT 


TATCAGAAAT 


CTATTGTACC 


960 


GTATGTAATT. 


TATTTCCTTT 


TTAAAACTGG 


CATGCGTTTC 


TCCGAATTGA 


TTGCTCTAAC 


1020 


TTGGGATGAA 


GTTGACGAG 










1039 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 793 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



CTGAGAAGTC 


AGGAAGGCGA 


TGTGGAGAGA 


TTGAGGAATT 


GGATACTTAA 


CTCCATAGTA 


60 


ATCTTCGTAA 


AACTCGATAG 


AGCGAACAGC 


GATATCCAGT 


GAGAAATCAA 


GATTTGAAAG 


120 


TGGATGTGCT 


TTGGTTGAGT 


AGACACCCTA 


CCAGGGTACC 


ATTTTTAGTT 


TTAGCGGTCA 


180 


CCCCTTGCAA 


ATCACCAGCA 


ACAAAGGCCA 


ACAAGTAAGA 


AGACATGCGA 


GGTGTTGTCT 


240 


CAAACTTCCA 


GATACCTGTT 


TCCTTACGGT 


TTTCAACATC 


GATTTCTGGC 


ATGTTTGACA 


300 


AGGCCAATTC 


ACCTTCTGCT 


TGGTCAAAGC 


GAAGAGAGAG 


GTCAAAAGTT 


GCTTTGGCTT 


360 


CAGGCTCATC 


CACACATGGG 


AAAGCTTCGC 


GCGCAAAATG 


GCTCTCGAAC 


TGAGTAGACA 


420 


AGACCTCCTT 


CTTGACTCCA 


TCAACTGTAT 


AATAAGAAGG 


GTAAATCCCT 


GTCATGTTGT 


480 
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CTGTAATTTT ACCAGAAAAG GCAAGAACCA ATTCAACTTG ACCAGCCTCA GCCAATTCGA 540 

TATGAAGGGC TTCATTGTCA TGGTCAACTG TAAATGGACG AGCTTGACCT GCAACTTCTA 600 

CAGAGGTGAT TTCCAAGTCT TTTTGGTGGA GGGAGATGCG GTCACTCTGT GCTTGACCAG 660 

TGATGGTCAC TTTCCCAGAA AAAGTCTTGG TCTCACGACT CAAATCTAAA AATAAATCAT 720 

AATGTTCAGG AACAAATTGC TTAATAAAAT GTTCAACTGC TTGCATAGTT TTCTCCTATT 780 

CTAAGTTTAA GAG 793 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



CGGATGATTT 


GCTTTTCATC 


TATGATAAAG 


TCTTTGTTGA 


GCGTCGGAAT 


CTCTACTGAC 


60 


TGGAAATTTC 


CCGTAGATAA 


TCCAAATGCC 


CTTTAAAGAA 


AACCTCATCT 


GTCAACACCG 


120 


AAATCATCAC 


TGCTCCGTTT 


TCTTCATAAG 


TCTGGGCCTG 


TTGCACAATA 


TCCACATCGA 


180 


GATTGATATC 


TCCCCAAAAC 


TAGGGCTAGC 


TTTCTTGACT 


TCAGCGATTA 


CCTGCAAGCG 


240 


GTCCTGATGA 


TTCTTCAAAA 


ATTCTGCCAA 


GCGATAGGTC 


TGGCGCAGAG 


GCTGGATTTG 


300 


CTCCAGCTTC 


ATCTGCTCCA 


CCTCACGCGC 


CTTCTGCTCT 


AAGATTCGTG 


CTAAAAATTC 


360 


CTGACTCATT 


TTTGGTACTC 


CTGTAACAGT 


CTGAGTTTTT 


CAAGGGCCTT 


GCCTCTAGCA 


420 


ATCACTTGAC 


GGGCCAAGGC 


AACTCCTTCC 


TTGATGCTAT 


CAATCTTACC 


ATTAGCATAG 


480 


AAACCAAGAC 


CAGCATTCAA 


GACTGTCGTT 


TCCAAGAATG 


GACTTGCTTC 


GTTTTTCAGA 


540 


ACGCTAAGCA 


AAATTTCTGC 


ATTTTCCTGA 


GCATTCCCAC 


CACGAATATC 


TTCCATAGCA 


600 


TAGCCTTCCA 


TTCCCAAATC 


CTCTGGAGTA 


AAGCTTGACA 


AG 




642 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 86: 



CTCTTGCAGG TTATTAAGGA GAAAACGGAG GTAATAGAAG TATGATTTAT ACAGTCACAC 60 

110 



WO 97/37026 PCT/US97/05306 

TCAATCCATC CATTGACTAT ATCGTTCGTT TGGACCAAGT CAAAGTTGGT AGTGTAAATC 120 

GTATGGACAG TGATGATAAG TTTGCTGGTG GGAAAGGAAT CAATGTCAGC CGTGTCTTGA 180 

AACGTTTGAA TATATCAAAT ACAGCGACGG GATTTATCGG TGGCTTTACT GGTAAATTTA 240 

NCNCAGATAC TTTAGCAGAG GAAGAAATCG AGACNCGTTT TGTCCAGGTG GCAGAAGATA 300 

CTCGTATCAA TGTTAAAATC NAAGCAGACC AAGAAACAGA AATCAACGGA ACGGGTCCAA 360 

CTGTTGAACC GGTTAAGCTA GAAGAATTGA AAGCTATTTT ATCTAGTCTG ACAGCAGAAG 420 

ATACAGTTGT CTTTGCAGGT TCAAGTGCTA AAAATCTAGG CAATGTTATC TATAAGGGAT 480 

TTAATCTCCT TGACGCGCCA GACTGGTGCG CAAGTGGTCT GTGACTTTGA AGGACAGACC 540 

TTAATTGATA GTTTGGATTA CCACCCTCTT CTTGTAAAAC CAAATAATCA TGAACTTGGA 600 

GCGATTTTTG GGGTTAAACT CGAAAGTTTA GATGAAATTG AGAAATACGC TCGTGAGTTA 660 

CTGGCTAAGG GTGCTCAAAA TGTTATTATC TCTATGGCTG GTGATGGTGC CCTTCTTGTC 720 
ACATCTGAGG GAG 



733 



(2) INFORMATION FOR SEQ ID NO: 87: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 



CTAAGGGAAG TAGGAAAAGT ATGTATCCAG ATGATAGTTT GACATTGCAA CACGGACTTG 60 

TACCAGATCA ACATGATGCA GGTTTACTTT GATCAAGGGA TTTACAATAA GAAGGCGGTC 120 

TTTGAGGTGT ATTTCCGCCA ACAGCCTTTT AAGAACGGCT ATGCGGTTTT TGCAGGTTTG 180 

GAAAGAATTG TGAACTATCT TGAAGACTTG CGTTTTTCAG ATAGTGATAT AGCCTATTTG 240 

GAGTCGCTTG GTTATCATGG GGCGTTCTTG GATTACCTTC GCAATTTCAA GTTGGAGTTG 300 

ACCGTTCGTT CTGCCCAAGA AGGGGATTTG GTTTTTGCTA ATGAACCGAT TGTGCAGGTG 3 60 

GAAGGACCTC TAGCCCAATG TCAGTTGGTC GAAACGGCTC TTTTGAACAT CGTCAACTAC 420 

CAGACCTTGG TGGCGACGAA GGCAGCCCCC TATCCGTTTG GTTATCGAAA ATGAACCCTT 480 

GATGGAGTTT GGGACACGTC GGGCTCAAGA AATGGATGCG GCCATCTGGG AACACG 536 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



fii) MOLECULE TYPE : Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



CTGCCCCTGT AAGGCTGGAC GATTGCCTTT 


CTTAGTATCC 


GCAAAGAGGT 


AAACTGAGAA 


c n 

oU 


TAGAGAGGAT 


TTCTCCTTCA 


ATATCTTTGA 


CAGACAGGTT 


CATCTTGCCT 


TCTACGTCTG 


120 


AAAAAATCCG 


CATATTGACC 


AGTTTTCTCA 


CAGCATAGTC 


CAAATCTTCC 


TCTTGGTCCT 


180 


CTGGTCCAAC 


ACCAACCAGC 


AATAAAAGTC 


CCTGATTGAT 


TTTTCCCTGA 


ATCTGGCCTT 


240 


CTATACTCAC 


TTGGGCTTTT 


TTAACCCGTT 


GGATAATGAT 


TTTCATAATA 


GCCTTTCTAG 


300 


TAAGAGCTAG 


GACAACTAGC 


CGTTGGTCCG 


TTTGACAGAG 


TAAACTTCTG 


GCACACTCTT 


360 


AATTTTATCG 


ACAACCGTGG 


TCAGTGTAGA 


GAGGTTGGCA 


ATACCGAAGG 


ACACATGGAT 


420 


ATTAGCAAAC 


TTCATATCCT 


TGGTTGGTTG 


GGCATTGACC 


GTTGAAATAT 


TCTTGGTTGT 


480 


ATTTGAAAGA 


ACTTGCAGTA 


CATCGTTCAA 


CAGTCCTGTA 


CGGTTGAGAC 


CGTAGATATC 


540 


GATATGGGCC 


ATATACTCCT 


TATTTGAG 








568 



(2) INFORMATION FOR SEQ ID NO: 89: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



60 



CTTTATATTG TACCAAGTAT CATTTCTAGT GCTGCTATGG GCTTGATTTT TCTTCAAATC 

TATAATCCAA ACTATGGTGT TGTTAACCAA ATTATTCATC TATTTAATCC ATCGTTTAAA 120 

GATTCAGTAC TGTTGACTCC CAGGATTAAA AATAGTAGCT ATGACTGGCG CTTATATCTT 180 

CTTTGCAGGA GCATCAACCA TTATGATTTC TTGGGCAAAT TTTTGCTATT CCAGAAGAAG 240 

TTCAAGAAGC TGCTATTTTA GACAATATTA CTGGTTGGAG GAAAGAGTGG TATATTACGA 300 

TTCCGATGAT TAAGGGGACA ATTAAAACTG TTTCAATTAT GGCAGCAACT TCAGGATTTT 360 

TGCTCTATAA CGAAGTATTC TTTTTGACAA ATGGTGCTGC AGGAACAAAA AGTATCAGTT 420 

TTGTTATTCG AGAATTAGCA GTGGCTAGCT CACGAACTCA GTATGCTCGT GCAAATACAA 480 

TTGGAGTTAT ACAAATCTTA GGTGGAATGT TGATTATCGT TTGTATTAAT ATTTTATTCA 540 

GAGAAAGAAA AAGACTGAAA GGTGGGAAAT GATTATGAAT ACACATATAA ATGGTATTAG 600 

TAAAAAAGGC AAAGTTCTTA TATATGGTTA TATGCTCCTT ACCATTTTAA TTTCTATTTT 660 

CCCTATTGCG TGGATTTTTT TATCATCATT AAAAGCAGAT CCTATGAAAA ATCCAGGTAT 720 

TAGTTTACCG ACTGACTTTA CTCTTGAAGG TTATATAAAT GTTTTTACAA AACTTCATGT 780 
TTTTACTTAC TTTTGGAATA 



800 



(2) INFORMATION FOR SEQ ID NO: 90: 



(i) SEQUENCE CHARAC1 ERISTICS : 
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(A) LENGTH: 686 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



CTCAGTATTA 


TCCCTACCAA 


AATGAACGTC 


AGTTAGTGAC 


TCAGGTGGTT 


TGGGAACAAT 


60 


GGGCTTTGGA 


ATTCCAGCAG 


CAATCGGTGC 


TAAAATTGCT 


AACCCAGATA 


AGGAAGTAGT 


120 


CTTGTTTGTT 


GGGGATGGTG 


GTTTCCAAAT 


GACCAACCAG 


GAGTTGGCTA 


TTTTGAATAT 


180 


TTACAAGGTG 


CCAATCAAGG 


TGGTTATGCT 


GAACAATCAT 


TCACTTGGAA 


TGGTTCGCCA 


240 


GTGGCAGGAA 


TCCTTCTATG 


AAGGCAGAAC 


ATCAGAGTCG 


GTCTTTGATA 


CCCTTCCTGA 


300 


TTTCCAATTG 


ATGGCGCAGG 


CTTATGGTAT 


TAAAAACTAT 


AAGTTTGACA 


ATCCTGAGAC 


360 


CTTGGCTCAA 


GACCTTGAAG 


CTACTACTGA 


GGATGTTCCT 


ATGCTAATTG 


AGGTAGATAT 


420 


TTCTCGTAAG 


GAACAGGTGT 


TACCAATGGT 


ACCGGCTGGT 


AAGAGTAATC 


ATGAGATGTT 


480 


GGGGGTGAAG 


TTCCATGCGT 


AGAATGTTAA 


CAGCAAAACT 


ACAAAATCGT 


TCAGGAGTAC 


540 


TCAATCGCTT 


TACAGGTGTC 


CTATCTCGTC 


GTCAGGTTAA 


TATTGAAAGC 


ATCTCTGTTG 


" 600 


GAGCAACAGA 


AGATCCGAAT 


GTATCGCGTA 


TCACTATTAT 


CATTGATGTT 


GCTTCTCATG 


660 


ATGAAGTGGA 


GCAAATCATC 


AAACAG 








686 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 91: 



TTTTCCCAGC 


TATTTTACTG 


AGTATAAAAG 


CCTTATTTAA 


CTTACTCTAT 


GTACTCGGTT 


60 


TTCTAGGAGG 


AATGTTGGGA 


GTTGGGATTG 


CTTTGGGGTA 


CGGAGTGGCC 


TTATTTGACA 


120 


AGGTTCGGGT 


GCCTCAAACA 


GAAGAATTGG 


TGAATCAGGT 


CAAGGACATC 


TCTTCTATTT 


180 


CACAGATTAC 


CTATTCGGAC 


GGGACGGTGA 


TTGCTTCCAT 


AGAGAGTGAT 


TTGTTGCGCA 


240 


CTTCTATCTC 


ATCTGAGCAA 


ATTTCGGAAA 


ATCTGAAGAA 


GGCTATCATT 


GCGACAGAAG 


300 


ATGAACACTT 


TAAAGAACAT 


AAGGGTGTAC 


TACCCAAGGC 


GGTGATTCGT 


GCGACCTTGG 


360 


GGAAATTTGT 


AGGTTTGGGT 


TCCTCTAGTG 


GGGGTTCAAC 


CTTGACCCAG 


CAACTAATTA 


420 


AACAGCAGGT 


GGTTGGGGAT 


GCGCCGACCT 


TGGCTCGTAA 


GGCGGCAGAA 


ATTGTGGATG 


480 


CTCTTGCCTT 


GGAACGCGCC 


ATGAATAAAG 


ATGAGATTTT 


AACGACCTAT 


CTCAATGTGG 


540 


CTCCCTTTGG 


ATCGAAATAA 


TAAGGGACAG 


AATATTGCAG 


GGGCTCGGCA 


AGCAGCTGAG 


600 
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GGAATTTTCG GTGTAGATGC CAGTCAGTTG ACTGTTCCTC AAGCAGCATT TTTAGCAGGA 660 

CTTCCACAGA GTCCCATTAC TTACTCTCCT TATGAAAATA CTGGGGAATT GAAGAATTGA 720 

TGAAGACCTA GAAATTGGCT TAAGACGGGC TAAGGCATTC TTTACAGTAT GTATCCTACA 780 

GGTGCATTAA GCAAAGACAA TT 802 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



CTACTATTTT 


AGCATAAAAA 


TGCCCAAAGG 


GGGNGCCGTG 


TGTTTACTGA 


TTTTCAGGNT 


60 


AATGGACCAG 


GNAAATCAGC 


ATGAAAATAA 


AAAGAGAAAC 


AGATTATTTT 


AGCCATTTGT 


120 


CAGATTTATG 


CTATGCTTAA 


GGTAGAAAAT 


GAAAGGGATA 


ACAAATGTAT 


TTAGGAGATT 


180 


TGATGGAGAA 


AGCCGAGTGT 


GGTCAATTTT 


CAATCCTTTC 


CTTTCTATTA 


CAAGAGTCTC 


240 


AGACGACCGT 


CAAGGCTGTA 


ATGGAAGAAA 


CAGGATTTTC 


AAAAGCAACC 


CTAACCAAAT 


300 


ATGTCACCCT 


GCTCAATGAC 


AAGGCTTTGG 


ATAGTGGCTT 


AGAACTGACT 


ATTCACTCAG 


360 


AAGATGAAAA 


TCTGCGTCTG 


TCTATAGGTG 


CAGCTACTAA 


GGGGAGAGAT 


ATTCCGGAGC 


420 


TTGTTTTTTG 


GATAGTGCTG 


TTAAATACCA 


GATTTTGGTT 


TATCTTCTCT 


ACCACCAACA 


480 


GTTTTTAGCC 


CATCAGCTGG 


CTCAAGAATT 


GGTGATTAGC 


GAGGCTACGC 


TTGGTCGTCA 


540 


CTTAG 












545 



(2) INFORMATION FOR SEQ ID NO: 93: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 57 6 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

GA.GGTCAAGA TTGATTCCCA AGGTTNG AGG AATCTGCGGA ACCAGATTTA GAAGAAATCG 60 

AGCGGTTACC ATGCTTGGCC ATGTGAATAC CGCCACCAGC CAAGACfcAAG GCTGCAGTTG 120 

TGGAAATATT AAAACTGAAA GACTTGTCCC CACCTGTACC ACAGTTGTCC ATAGCATCAT 180 

GAATCTCAGT TGGAATATGC TGGGC ATG'TC CTCTCATGAC TTGGGCAATG GCTGTGCGTT 240 
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CTTCAGGTGT TTCCCCCTTC ATCTTAAGAG CTAAGAGGAG AGAAGCAATC TGCGCTTCAG 300 

TTACACGCCC AGTTACGATA CGCTCAATGA CATCCGTCAT TTCCACACCT GATAAATTTT 360 

CAAATTTTGC TAGTTTTTCA ATAATCTCTT TCATCCTAGT TTCCTCACTT TACAACCTCC 420 

TCGATAAAAT TCCGAATAGA AGACAAGCCG TCTGGCGTTC CAATGCTCTC TGGATGGTAC 480 

TGGAAGCCAT AAATCGGTAG GTTTTTATGT TGAATCCCCA TGATGGCTTG GTCATCAGTC 540 

GAACGAGATC AAGCTTATCG ATACCGTNGA CCTCGA 576 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 



ATTAAAGGGA 


GGCGAACATG 


GCCCAAGATA 


TAAAAAATGG 


AGAAGTAGAA 


GAAGTTCAAG 


60 


AAGAGGAAGT 


TGTGGAAACA 


GCTGAAGAAA 


CAACTCCTGA 


AAAGTCTGAG 


TTGGACTTGG 


120 


CAAATGAACG 


TGCAGATGAG 


TTCGAAAACA 


AATATTCTTC 


GCGCTCATGC 


AGAAATGCAA 


180 


AATATCCAAC 


GCCGTGCCAA 


TGAAGAACGT 


CAAAACTTGC 


AACGTTATCG 


TAGCCAGGAC 


240 


TTGGCAAAAG 


CAATCTTACC 


ATCTCTTGAC 


AACCTTGAGC 


GTGCACTTGC 


AGTTGAAGGT 


300 


TTGACAGATG 


ATGTGAAGAA 


GGGCTTGGGG 


ATGGTGCAAG 


AAAGCTTGAT 


TCACGCTTTG 


360 


AAAGAAGAAG 


GAATTGAAGA 


AATCGCAGCA 


GATGGCGAAT 


TTGACCATAA 


CTACCATATG 


420 


GCCATCCAAA 


CTCTCCCAGC 


AGACGATGAC 


CACCCAGTAG 


ATACCATCGC 


CCAAGTCTTT 


480 


CAAAAAGGCT 


ACAAACTCCA 


TGACCGCATC 


CTACGCCCAG 


CAATGGTAGT 


GGTGTATAAC 


540 


TAAGATACAA 


AGCCCGTAAA 


AAGCTCGCAG 


TAAAAATAGG 


AGATTGACGA 


AGTGTTCGAT 


600 


GAACACAAGA 


AAATCTANCT 


TTTTTACTCA 


GAGCTTAGGG 


CGTGTTCGAT 


TCGGCAATTC 


660 


TGACGGTAG 












669 



(2) INFORMATION FOR SEQ ID NO: 95: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xii SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
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CTAGTTCACC 


ATTTCTATTT 


H & a n" M TVV2/^P p P 
Annl IVoOV. X 


PAAPTAPAPP 


R A A AT ATP AT 


ATAAGGTAAA 

^\ x x ****** 


60 


GTTCCCACTA 


» /~»» ariv^'ppftp 
AuAA IV i UAu 


AAivnnnl IV 


TTPAPTAAPT 

X 1 LAW x /U\W A 


TCAGCGGTCA 


AACTTAGCTT 

now x x nu\t x x 


120 


ATCCAA i 1 1 V 


nr^APPfiATA & 


appp AT APTT 


CAGAGATACT 


TCCTCTTGAA 


ATTTAGAGTA 


180 


ACTCTTATTA 


n & P A T f PTY2 & 1* 


GTATPGTTAT 


CGGGCTATTA 


GAAGGAGCCC 


GATTTAAAAC 


240 




ATY5A AATTTC 

A luAAA i I 


TCCTTCAAAA 


AACAAAACAT 


TCTATATACT 


TAAATTTTAG 


300 


GGCAAGACTG 


CCGVL TV-V-AL. 1 


n a pp aprptp 


PAPTTGPPAG 


CAAAGCTGGC 


AGGTC C AGCC 

AUVI llaU AUVa V» 


360 


TTACCCTAAA 


TATAAATTAT 


TTAATTTTTG 


CCATTTCAAT 


CAATTCTGCA 


AAGGTCCCAA 


420 


GCTTTCTGGA 


TTGAATTACC 


TCATATTGGG 


CATTCTGTCC 


AAATGAAGCA 


TTATTAATCA 


480 


CTTCAACCTC 


ATAAAGGCCT 


GGAGCAGAGA 


AAACAGCTCC 


AAATTTGGAT 


TTATCGAAAT 


540 


GCATACCTAG 


GTCACTCCAA 


CAATCATTAA 


GGGTGGAGAC 


TTTTATCCAG 


CCTCGTTCTT 


600 


TCTTAGTAGA 


AATAG 










615 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 773 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



CTACTGTAGT 


TGGAACTGTT 


GAAAAACAAG 


GAAAACAAAA 


GAAAGTTGTT 


ACTTACAAGT 


60 


ACAAACCTAA 


AAAAGGTAGC 


CACCGTAAAC 


AAGGTCACCG 


TCAACCATAT 


ACAAAAGTTG 


120 


TCATCAACGC 


AATTAACGCT 


TAATTTTAAG 


GAGAACACAT GATACAAGCA GTCTTTGAGA 


180 


GAGCCGAAGA 


TGGCGAGCTG 


AGGAGTGCGG 


AAATTACTGG 


ACACGCCGAG 


AGTGGCGAAT 


240 


ACGGCTTAGA 


TGTCGTGTGT 


GCATCGGTTT 


CTACGCTTGC 


CATTAACTTT 


ATCAATTCTA 


300 


TTGAGAAATT 


TGCAGGCTAT 


GAACCAATCC 


TAGAATTAAA 


CGAAGATGAA 


GGTGGCTATC 


360 


TGATGGTTGA 


AATACCAAAA 


GATCTTCCTT 


CACACCAGAG 


AGAAATGACC 


CAGTTATTCT 


420 


TTGAATCATT 


TTTCTTAGGT 


ATGGCAAACT 


TATCGGAGAA 


CTCTTCTGAG 


TTCGTCCAAA 


480 


CCAGAGTTAT 


CACAGAAAAC 


TAACACGGAG 


GAAAACATTA 


TGTTAAAAAT 


GACTCTTAAC 


540 


AACTTGCAAC 


TTTTCGCCCA 


CAAAAAAGGT 


GGAGGTTCTA 


CATCAAACGG 


ACGTGATTCA 


600 


CAAGCGAAAC 


GTCTTGGAGC 


TAAAGCAGCT 


GACGGACAAA 


CTGTAACAGG 


TGGATCAATC 


660 


CTTTACCGTC 


AACGTGGTAC 


ACACATCTAT 


CCAGGTGTAA 


ACGTTGGTCG 


TGGTGGAGAC 


720 


GATACCTTGT 


TCGCTAAAGT 


TGAAGGCGTA 


GTACGCTTTG 


AACGTAAAGG 


ACG 


773 



(2) INFORMATION FOR SEQ ID NO: 97: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
713 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

CTGGATAATT GGAGCAATTC CTGATGCCAG TTTGAGGTTA CTGATAGGAT TGTGGGCGAT 
AGCCACTTGA GAAGATGCCA AGAGTTCAAT TTCTCTCTCG TTTAATTCGA CCCCGTGAGC 
AAATACGGAC GGATGATCTA AATAACCCAG TTCTTCAAGA AAAGCAAGGG GGCGTTTGCC 
GTACCGTTTG AGGATAATTC CTGACTCCTC CTTGGTCTCC GCCACATGGA CATCGAGCGG 
AATATTTAGC TCTTTTGCCA TTTCCAAACT CGCTTCCAGC AAGTCTCTAC TGCAGCTATA 
CGGAGAATGA GGTGCTACCA TAACCCTTGA AATTTGGGAT TTTTATATTT TAAGATTTCG 
TCTATGATGG ATCGAGTTCT GCTTATAGTC TCAGCAGTTG TTTCTGTCTC TGAAGAAAAG 
AGAGTCGGAG AAAAATAACA ACGCATCTTG GAAGTTTTCA CCACCTGATA AATTTGCTGG 
ATATCCACAC CATTGGGATT ATACATATCG TTAAAGGTTG TTGTTCCTGA CTGGAGCATC 
TCTGTTAGGG CTTCTTTGAC CGCATTGGTA GTCATGTCGG GAGTAAACTC AGATTCTGCT 
GGCCAGATAT AGTCATTGAG CCATTTCATG GAGATTGCTG TCATCTCGGA TCCCTCTCAG 
ACCTGTCATT GCAGAATGGG TGTGACAATT GACCAAACCA GGCATAAGCC AAG 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CTAGAAAATC TGCTTTTTTC TCGAAAAGTT TAATACTTGG TACAATATTA CGCTTGCTTT t>u 

GCTCGGCTGC TCCAAGGGCA ATTTTTTCTA GTTTTTCAAC TTTTTTACCC AATTTCTTGC 120 

CATCCCACTT GGCAACTGAC CAGTCTGCAG GAAAGGCCCA GATTTGGCTA GCACCCAGTT 180 

CGGTTACTTT TTGAGCGATG AACTCCAGCT TGTCTCCCTT GGGAAATCCA GATGCGATGG 240 

TCACTTGGAC TGGTAGTTCC ACATTGTCAT TTAATTCTTG GACCAACTCA AACTGACGAT 300 

TTTCCATATC CAGCACGCGC GCCAAGCGCT TGATGCCATC ATCAAAGACT AAGGTAACCT 3 60 

CATCCTCTTC TTTCAAGCGC ATAACCTGAA ACATATGCTT ACTGGTTTCC TTGTCCTCGA 420 

TAGTGACAGG AGAGATAGCA CTGCCTTTTA CAAAATACTG CTGCATGCTA GCCTCCAATC 480 

ACACCAGAGA TATCCTTGGT TTTCTTAAAG ACACAGGTAT TCCATTCCCC TTGAACCATG 540 

TGAGTTTCGA GGAAAAATCC AGCTGATTCA GCCGACTCGC GCACCATGTC CCACTTATGC 600 

CTTGAATAAT GCCACTCATG ATCAGGTAGC CTTCGTCCTT AACCAAACGA TAGGCATCGT 660 

CTATTAGATG AATGAGGATA TCCGCCAAGA TATTAGCCAC AATCACATCT GCCTCAATTT 720 

CCACACCCTT AAGCAAATCT CCAGCCGCTA CATGGATATT TTCCATGCCA GGGTTGAG 778 
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(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



CTTCCCAGAT 


TTTTTCGTTG 


ACTTGCTGTT 


CAATCGCACG 


CAATTCTTCA 


GCAGTTACAG 


60 


CTTGGAAGTG 


GGTAAAGTCA 


AAGCGAAGGA 


ATTCAACTTC 


GTTAAGAGAT 


CCTGCCTGTG 


120 


TTGCGTGGTT 


TCCAAGGATA 


TTGTGAAGGG 


CAGCGTGAAG 


CAAATGAGTC 


GC AGTGTGGT 


180 


TTTTCATGAC 


ACGGTGACGG 


CGATTGCTAT 


CAATTGCCAA 


GGTATATTCT 


TGGTTCAAGG 


240 


CAAGCGGTGC 


AAGGACTTCA 


ACTGTATGAA 


GGGCTTGACC 


ATTTGGGGCT 


TTCTGAACAT 


300 


TGGTCACAGT 


AGCCACAACC 


TTACCTGACT 


CATCCAAGAT 


TTGTCCGTAG 


TCAGCTACCT 


360 


GTCCACCCAT 


TTCAGCATAA 


AATGACGTTT 


CCCGCAAAGA 


TAAGAGAGGC 


AGTTCCTTCT 


420 


GAAACAGCTT 


CTACTTCTGC 


ATTGTCCGCC 


ACGAATAGCT 


ACCAATTTAG 


AAGACAATTG 


480 


GCTAGCATTG 


TAGTTGAAGG 


CACTTTCTAC 


AGTGATGTTT 


TGAAGAGTTT 


CCATTTTGCA 


540 


TACCCATTGA 


GCCACCCTTG 


ACAGCTGACG 


CACGCGCGCG 


TTCTTGTTGT 


TCTTTCATGG 


600 


CTGCTTCAAA 


ACCTTCACGG 


TCTACAGTCA 


TACCAGCTTC 


TTCAGCGATT 


TCTTCAGTCA 


660 


ATTCAACTGG 


GAACCCATAA 


GTATCATAGA 


GTTTGAAGAC 


ATCTGAACCA 


GCGATAACAG 


720 


ATTGACCTTT 


TTCTTTCAAG 


TCTGCTACAA 


TGCCTTGGGC 


AAAGTGTTGA 


CCTTGAGTGA 


780 


AGGGTACGGG 


CAAATGATTC 


TTCTTCGCCT 


CTTAACGATT 


TTCTCAATAA 


AGTCACGTTT 


840 


CTCAAGCACT 


TCTGGGTAGT 


AGCTTTCCAT 


GATTTTTCCA 


ACAGTTGGAA 


CGAGTTTTGT 


900 


AAAGGAAAGG 


CTCGTTGATA 


CCCAATTTTT 


GACCCATGCA 


TAGAAGCACG 


ACGGAGAAGA 


960 


CGACGAAAGG 


ACATAACCAC 


GACCCTCCAT 


TTCCTGGAAG 


GGCACCATCC 


ACCGATGGCA 


1020 


AATGAAAGTG 


AACGGATGTT 


GGTCCGCGAA 


TGAACCTTGA 


AT 




1062 



(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 865 base pairs 
!B) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
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CTCAAAGGTC GCACCTGCAA TACGACCCGC 
ACTATTGCCT ACATAACTAC CAAAGTCATC 
AGCCTTGTCA GGTGCGACAT AGCGCAAGAT 
GTAGCCACCG TCTGTCTCAA ATCGATAGGC 
TACACGCTCT GTGTATGCTT TCATTCTGTT 
ACCATAGAAA GCGTACTGAC AATCTATGGT 
AGAATTCATA AACACTGTCT TACTTTTGAT 
GGTTTACTGT ACCTGTAGGA GACGGTGTAT 
ATAGGGCAAA GAGTTGGGCA TAAACGATGT 
CACCGCCACA ACCAAGGGCC ACTTCTTTGA 
AAAGCACAAC ACGACGAGCA ATCTGGTCAC 
ACTTACGAGT GTAGTCCGTC CTTGTACCAA 
GATATTTGGA CCGTGACGGA ACCAACTGGG 
GTTAATCCCA AATCTTGAGC TGAGCCTCAT 
GAGACCGGTT AAGTCTTAAT CACAG 

(2) INFORMATION FOR SEQ 
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TACAGGACCT ACACTTGCTC CATGCTTGGG 60 

AAATCCCAAG ATAACATTGG CAAAATTTCC 120 

AGTCGCACCA TAAGTCATAA CCTCAAGTTG 180 

CAAGACATCC TCACCCTCAA CATTTCCAAA 240 

CTCCTTTTAC TATTTCTCTC AAGCAAACAA 300 

TTATCTGATA ATTTACAAAT CCTCTTGTCA 360 

ATTCGTGAAT TATGACACCT TGTACTACAC 420 

CTGGTTTATT TTCTACCTTG AGTGAAGTCA 480 

AAGGGAAGAC ACGGTAAATA TCATTCAAGA 540 

CATTTTCAAG ACCAAAAGCT TGATCACTCA 600 

CAGCAACTTC ACGAACCAAG TCCAAGTCGT 660 

AGAACAAAAC AACTGTATTG TCGTTGATAA 720 

CTTTCATACA TGGTCGCAAC TTGAACAACA 780 

GAGCAGTCCA AAGAAGACCA GCGCCTAGAA 840 

865 

ID NO: 101: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 



TTTTGCTCCA GTTGATAAGA AAGGCTTGGT CGATGTTGAG GCGTTAGCAG GTTTGATACG 60 

GCCTGATACA ATCCTCGTTT CCCATCATGG CTGTGAACAA TGAAATCGGC TCTATCCAAC 120 

CNATTGAGGC TATTTCAGAA TTCTTGGCAG ACAAGCCGAC TATTTCCTTC CACGTTGATG 180 

CGGTTCAGGC GCTTGCCAAA ATTCCGACTG AAAAGTATCT GACAGAACGG GTGGATTGCG 240 

CGACCTTCTC GAGTCATAAA TTTCATGGTG TCCGTGGTGT TGGCTTTGTC TATATCAAAT 300 

CTGGCAAGAA GATTACACCT CTTCTTACAG GTGGTGGCCA GGAGCGAGAT TATCGTTCGA 360 

CAACTGAAAA TGTGGCAGGG ATTGCAGCGA CAGCCAAGGC TCTCCGTTTA TCTATGGAAA 420 

AGCTAGATAT CTTTAGGAGC AAGACTGGGC AGATGAAGGC AGTGATTCAC CAAGCTCTTC 480 

TGAACTATCC GGATATTTTT GTCTTTTCAG ATGAGGAAGA CTTTGCACCT CATATTCTGA 540 

CTTTTGGAAT CAAAGGTGTT CGAGGTGAAG TCATCGTTCA CGCCTTTGAA GACTATGATA 600 

TTTTCATCTC AACAACCTCA GCTTGTTCAT CTAAGGCAGG AAAACCAGCC GGTACCTTGA 660 

TTGCCATGGG AGTGGACAAA GATAAGGCCA AGTCAGCTGT GCGTCTTAGC CTAGACTTGG 720 

AAAATGATAT GAGTCAGGTC GAGCAG7TTT TCACCAAGTT AAAATTGATT TACAATCAAA 7 B0 

CTAGAAAAGT AAGATAGGAG CATTCATGCA GTATTCAGAA ATTATGATTC GCTACGGAGA 840 

GTTGTCAACC AAGGGTTAAA AACCGTATGC GTTTCATCAA TAAACTTCGT AATAATATTT 900 

CGGAOCTTTT GTCTATCTAT ACCCAAGTTA AGGTAACAGC AGATCG o 46 

119 



WO 97/37026 



PCT/US97/05306 



(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 552 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 



CTTGCTCGTA 


ACAGGTGCTA 


TCCTTGGTGT 


CAATGTTCAC 


ATCTTCTCAC 


CAAAAGAACT 


60 


CTTCCCAGAA 


AAAGAAATCG 


TTGAATTGGC 


AGAAGGATTT 


GCTAAAGAAA 


GTGGCGCACA 


120 


TGTTCTCATC 


ACTGAAGATG 


CTGATGAAGC 


AGTTAAAGAT 


GCAGACGTTC 


TTTACACAGA 


180 


CGTTTGGGTA 


TCAATGGGTG 


AAGAAGACAA 


ATTCGCAGAA 


CGTGTAGCTC 


TTCTTAAACC 


240 


TTACCAAGTC 


AATATGGACT 


TAGTTAAAAA 


AGCAGGCAAT 


GAAAACTTGA 


TCTTCCTACA 


300 


CTGCTTGCCA 


GCATTCCACG 


ATACTCACAC 


TGTTTATGGT 


AAAGACGTTG 


CTGAAAAATT 


360 


TGGTGTAGAA 


GAAATGGAAG 


TAACAGACGA 


AGTCTTCCGC 


AGCAAGTACG 


CTCGCCACTT 


420 


CGATCAAGCA 


GAAAACCGTA 


TGCACACTAT 


CAAAGCTGTT 


ATGGCTGCTA 


CACTTGGTAA 


480 


CCTTTATATT 


CCTAAAGTAT 


AATTTTAGAT 


AATAAACCGT 


CTACCAACAG 


CTATGAGGGC 


540 


TGCGACTAAT 


AG 










552 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID KO:103: 



CTAAAAAAAG 


TCAAGTAGAA 


AACGAATATT 


CTACTTAACT 


TGCACGAAAT 


TATTTTTCAC 


60 


GAATGACTTC 


GACCTTATAT 


CCATCAGGGT 


CTTTGACAAA 


GTAATAGTTT 


GGTGCAGTTC 


120 


CTGGTAGACC 


ATTTGGCTCA 


GTCACTTCAT 


AGCCTTTTGT 


ACTGTGCTCT 


TGATGAAGTG 


130 


CCTCAAGATC 


AGGTGTACTG 


AGGGCGATAT 


GGGCAAACCC 


ATCACCAACC 


ACATACGGAC 


240 


CGTGATCGTA 


G7TATAAGTC 


AACTCCAAC? 


CATAGTCATC 


ACCCTCAAGA 


CCTAGATAGA 


300 


CAATCGTGAA 


GGCATGGTCT 


GGAAAATCTC 


TGCGACGCAA 


TTCTTTAAAA 


CCAAAAGCAT 


360 


CTTGATAAAA 


TGCAATTGAT 


TTTTCAAGAT 


TTTCTACTCG 


TAAGCAAGTG 


TGTAGCATTT 


420 


TTGAAGCCAT 


ATCTTTCTCC 


TrTATlTTTA AAAAGACTGG ACAATCCTGT 

120 


TCCAGTCTCA 


480 
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TCAGTTGTTA TTTACCAAGT TTTGCTTTAG CT 512 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 

CTGGACCACA CTATTTTCTG TGTTGGCTAT CGTGTCATGC AAAAGGATCT AGAAGGGACG 60 
CTGGATGCTG AAAAACTCAA GGCTGCTGGT GTTCCGTTCG GCCCGCTTTT TGGTAAAATC . 120 

AAAAACGGCC AGGATCTTGT TTTGGAAGAC GGAACTGAAA TCAAGGCAGC AGACTATATC 180 

TCAGCGCCAC GTCCAGGTAA GATTATCACT ATTTTAGGAG ACACTCGAAA AACGGATGCC 240 

AGTGTGCGTC TGGCTGTCAA TGCAGATGTC CTAGTTCATG AGTCCACTTA TGGCAAGGGT 300 

GATGAAAAAA TTGCTCGTAA CCATGGTCAC TCAACTAATA TGCAAGCTGC ACAAGTAGCG 360 

GTAGAAGCAG GTGCCAAACG CCTCCTACTC AACCATATCA GTGCCCGTTT CCTCTCAAAA 420 

GATATAAGCA AACTCAAGAA GGACGCTGCC ACAATTTTTG AAAATGTCCA TGTGGTCAAA 480 

GACTTGGAAA AAATGGAAAT CTACCAGTCA CAGAAAGGAT AAGTATGCCT ACTATTCTCC 540 

ATTAACCGGA A 551 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CCGTCATTAT ACCTTTGTCT ATGAAAATGA AGACCTAGTC TATGAGGAGG AAGTCTTATG 60 

ATACAGCCAG CAAGTTTAGA AGAATTAGCA TCTTTAGTGG AAAAAGCGGG CAAGAAGGTC 120 

TTCCTTTTTG TGGCAGACTG GTGTGGCGAT TGTCGTTATA TTTATCCTGC CTTACCAGAG 180 

ATTGAGGAGA CCAATCCAGA GTTCACCTTT ATTCGAATGG ACCGAGATCA GTATATGGAT 240 

TTGGCCAAAC TCTGGGATGT TTACGGAATT CCTAGCCTTG TTGTTCTAGA AAAGGACAAG 300 

GAAATTGGTC GTTTTGTCAA TCGCGACCGT AAAAGTAAGG AGCAAATTAA CGATTTTTTA 360 

GCAGGATTGA AATAGGAGAA AAAGGAAACA ATGATTTTTA CGTATAACAA AGAACATGTC 420 
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GGTGATGTCC TTATGGTCAT CGTGAAAAAT AGCGGAGATG CCAAACTGAA TGTGGAACGC 480 
AAAGGCAAGG TAGCCCGTGT TTTCCTCAAA GAAAATGGGG AAACAGTAG 529 

(2) INFORMATION FOR SEQ ID NO: 106; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

CTAGGATAGA CCGCTTTTTA GCATTTATCT AAGCATTCCA GAGTACATGT ATCTTGCATG 
TGCTCTTTCT TTTGGGGTTG AAACGATAGG AGAAGGAAAT GTTAGAATTG CTTAAATCAA 
TCGATGCTTT TGCTTGGGGA CCGCCCCTCT TGATTTATTG GTCGGAACAG GGATTTACCT 
AACCATGCGG CTAGGACTCT TGCAGGTTTT GCGTCTGCCC AAGGCCTTTC AGCTTATTTT 
TATCCAGGAT AAGGGACATG GTGATGTATC CAGTTTTACA GCTTCTTGTG TAACAGGCCC 
TTGGGCATTC AAACTGGTTG GGAAACAGGG AAATATCCAT TAGGGAGTTG GCGACGGCTA 
TCAAGGTTGG TGGACCAGGA GCTCTATTTT GGATGTGGAT GGCGGCTTTC TTTGGAATGG 
CTACCAAGTA TGCGGAAGGA CTCTTGGCCA TCAAATACCG CACCAAGGAC GACCATGGTG 
CAGTAGCGGG AGGTCCCATG CATTATATCC TTCTAGGGAT GGGAGAAAAG TGGCGACCAC 
TTGCTGTTTT GTTTGCAGTA GCAGGAGTAT TGGTTGCTCT CTTGGGAATC GGAACCTTCA 
CCCAAGTCAA CTCGATTACA GAATCTATCC AAAATACAAC GACGATTTCG CCAGCCATCA 
CAG 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

CCTTGACCAC CAATTCTTCA CCCGATATGA TGCGCCCTGC TCGCTGAGCA AGCCCCAAGA 
GATTACTTAT CTTTTGCTTA TTCAAGTCCC AACTCTCTTC TTTTCACTTT GTGATCCACA 
TAAGCGATCA ACTCGTCATA AAAGCTTTCT TCCACTTCCA TGCTAAAGCT GCGGTTAAAG 
ACCTTCTTCT TTTTCGCCTC TAGGCCTTCT G^.TTCTCTA GTTTGATATA AGCGCCGCGG 
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CCATTGGCCT 


TGCCCGTAGG 


ATCAATAAAG 


ACTTGTCCTT 


CCTTGTTCTT 


GACAATGCGG 


300 


AGCAAATCAC 


GCTTATCAAT 


V-At. 1 1 A 


<jAV»AU AAC AVj 


AL I i bvbLAA 


AGGGATTTTT 


3 ou 


CTTGTTTTCA 


TCTTTCCCTC 


1 v— l iivx, not 


ill Inl iLl i 




Oil i Iv. 1 AL. 1 




TCCAACTCTA 


CTGAAGCAGC 


GTCTTCCATG 


GCTTCAAATT 


CGCTAGCAGA 


CTTGATATCG 


480 


ATACGGTAAC 


CAGTCAAGTG 


AGCCGCCAAG 


CGCACGTTTT 


GTCCACGACG 


ACCAATGGCA 


540 


AGAGAAAGCT 


TGTTATCTGG 


AACAACCACC 


AAGGCACGTT 


TGCTGTCGTT 


TTCATCAAAG 


600 


ATAACTTGGT 


CAACCTCAGC 


AGGAGCGATG 


GCATTGTAGA 


TAAATTCAGC 


TGGATCTGCT 


660 


ACCCACTCGA 


TAACATCGAT 


ATTTTCTTCG 


ATTGGTACCA 


TGCGGTCATT 


TTTAGCATCG 


720 


TAACGAG 












727 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 



CTAATGTAAA 


TCTATGGGAT 


AAACGTTAGC 


TTCTTTTAAT 


AGATTATTAA 


GCAATTTTTT 


60 


AAAACAACTC 


ATCAAACAAA 


CTCAACTGGT 


TATCTCTGGC 


ATATTTCCAA 


GAATACCCAT 


120 


CTCATCCATC 


TTTTCAACCA 


AGGTTGATGA 


GAGTCCACCA 


CGCTTGCGTA 


GTTCTGTTTT 


180 


AGAGAGGAAT 


TCTCCCTCTT 


CACGCGCCCG 


CACCAGTTGC 


TTGGCAACGT 


TCTCCCCCAG 


240 


ACCATCCATT 


GCTACAAATG 


GTGGGATAAG 


GGTATCCCCG 


TCGATGAGGA 


ACTCTGTCGT 


300 


CTGACTACAG 


TAGAGATCTA 


ATTTACCAAA 


CTTGAAACCT 


CGTTCCCACA 


TCTCATTGAC 


360 


AATCTCAAGA 


GTTGTATAGA 


GATCGATTTC 


CACATTAGAG 


GCTTCATTGT 


TCTTCCGTTT 


420 


TTCAGAGATT 


TCTTCCATTC 


TGCGCTTGAT 


GACCTCCAAG 


CCCGCACCCA 


TGGTCTTGAT 


480 


ATCAAAAGCC 


TTAGCACGAA 


TGGAGAAGTA 


AGCACAGTAG 


TAATAAATAG 


GATGGTGAAC 


540 


CTTGAAGTAA 


GCTACACGCA 


AGG 








563 



(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
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C TAACTTCAA 


AAGCTTTTAG 


AATATCTTCT 


TGtACTAATCA 


TATGATTACC 


GCCCAAGGTT 


60 


AAGTTTTCCA 


AAATAGAGCC 


ATTAAAGATA 


TAGGCTTG i J. 


GGGGTAGGTA ATTAATATGA 


120 


CGGCGCAAGA 


CTTTTTTTAT 


CAATGTTTTT 


AATATCCTGA 


TGATTGATGG 


AAATATGCCC 


180 


TTTGTAGGGT 


TCAAAGAAAT 


TGACAATCAT 


TTTGGCTAAA 


GTTGTTTTAC 


CAGAACCACT 


240 


AACTCCAACT 


AGGCTAACCT 


TATCTCCTTG 


TTTAATCGTG 


AGATTAATAT 


CTGTTAAGGT 


300 


ATCTCGTCCA 


AAACCATACT 


TATAAGAAAG 


GTCATCAAAT 


TCAATATCGC 


CCATCAAAAA 


360 


ATGTGAATGA 


ACAGGGTTTT 


CTTGAACTTG 


AAATTCAGAT 


TCGACTAGAT 


AGACTTCGTT 


420 


CAAACGGTTA 


TTAGCGACCT 


TCGCAGATTG 


GAGTTTGGTT 


TGGAGGTTGA 


TAATATTTTC 


480 


CATAGGAGTT 


GTAAAGTAAG 


AAAGAAGTGT 


GTTAAAGGTA 


ATCAGCTGAC 


CGATAGAAAT 


540 


TTTACTCGAC 


ATGACTAATT 


GAGCGCCAAA 


CCATAGGATA 


AGGATATTCA 


GAACTAATTT 


600 


TGTTTCCCCT 


GCTTTAAACT 


CGTTTGTAAA 


ATAGAATATT 


TACTGAG 




647 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



CTATCCTNGA 


ACCAATGATG 


CTTGTAACAA 


TCACTGTTCC 


AGAAGAAAAC 


CTTGGTGATG 


60 


TTATGGGTCA 


CGTAACTGCT 


CGTCGTGGAC 


GTGTAGATGG 


TATGGAAGCA 


CACGGTAACA 


120 


GCCAAATCGT 


TCGTGCTTAC 


GTTCCACTTG 


CTGAAATGTT 


CGGTTACGCA 


ACAGTTCTTC 


180 


GTTCTGCATC 


TCAAGGACGT 


GGTACATTCA 


TGATGGTATT 


TGACCACTAC 


GAAGATGTAC 


240 


CTAAGTCAGT 


ACAAGAAGAA 


ATTATTAAGA 


AAAATAAAGG 


TGAAGACTAA 


TCCGTCCTCA 


300 


CTCTAGAAGG 


AAGTCACTTA 


GTGGCTTCCT 


TTTGTCTTTA 


GAAAATACCT 


CTAAATATGG 


360 


TAAAATAGTA 


GAAGAATAAT 


GTGAGGAAAA 


TGAATGTCAA 


ATAGTTTTGA 


AATTTTGATG 


420 


AATCAATTGG 


GGATGCCTGC 


TGAAATGAGA 


CAGGCTCCTG 


CTTTAGCACA 


GGCTAATATT 


480 


GAGCGAGTTG 


TGGTTCATAA 


AATTAGTAAG 


GTATGGGAGT 


TTCATTTCCG 


TATTTTCTAA 


540 


TATTTACCTA 


TTGAAATCCT 


TTTTACAATT 


AAAGAAAGTT 


TGAGCGAAGA 


ATTTCCTNAG 


600 


AANGCCATCA 












610 



(2) INFORMATION FOR SEQ ID NO: 111: 

(il SEQUENCE CHARACTERISTT.CS : 
(A) LENGTH: 984 base pairs 
(E) TYPE: nucleic acia 
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(C) STRANDEDNESS : single 
(O) TOPOLOGY: linear 



PCT/US97/05306 



(ii> MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 



TATAAGCATA 


GTACCAGTTG 


CCATTGACCT 


GGAACCAACT 


GTCTTCATAT 


CTCCATTACT 


60 


GCATTTAGGT 


AGTACCAAGT 


TGAACCATCT 


TGATACCAAC 


CAGTTGCCAT 


TGCTCCTGAT 


120 


GAACGGAGAT 


AGTACCATTT 


GTTCCCAAGG 


TTTTGCCAAC 


CTGTTTTCAT 


ATCGCCATTT 


180 


GGCTGGTCTA 


AATAATACCA 


AGTGGTACCT 


TCCTGATACC 


AACCAGTTGC 


CATTGCTCCT 


240 


GATGAACGGA 


GATAGTACCA 


TTTGTTCCCA 


AGGTTTTGCC AACCTGTTTT CATATCGCCA 


300 


TTTGGCTGGT 


CTAAATAATA 


CCAAGTGGTA 


CCTTCCTGAT 


ACCAGCCAGT 


GGCCATTGCT 


360 


CCTGAGGAAC 


GGAGGTAGTA 


CCACTTATTA 


CCTAGATATT 


GCCAACCTGT 


TTGCATAATA 


420 


CCAGTTGTTG 


GATCTAAATA 


GTACCAAGTC 


AAATCATCGT 


TTATCCACCC 


CGCACGTCTT 


. 480 


TCACCACCAA 


GGTAGTTTTC 


TCCATTAATT 


TCCGTCTTAG 


CTAGATAATA 


CCAGTAGACT 


540 


GATCATAAAG 


CCAACCTGTC 


TCTAAAGAAT 


GATTTTGATT 


AAAGTAATAG 


TTCGTATAAT 


600 


AACGNTTCTC 


TTCTTTATCT 


TCTGAATCTT 


CACGTTTTTC 


CCCGTACTTT 


CTTCCAACAC 


'660 


TGTCTTTAGT 


TTTAATCTCT 


AATGTTTTCC 


AACCAACAAA 


CTCTTGTAGC 


ACTCCATTTT 


720 


TATCGAAGTA 


GTACCACTCT 


GAATTTGGAA 


AACCTTCTAA 


TCTCATACCA 


TTTGGGTAAG 


780 


GACCAATTGT 


ACTACCTTTA 


GATGGAAACG 


GGATATATTG 


CCAGCCGACA 


ACCATCTCTC 


840 


CAGATAGAGA 


ATCAAAATAA 


TAGTACTTAC 


CATCAATCAC 


TCGCCAGTAG 


GTTTCTTTGA 


900 


GGTCCCCCTT 


TTTGTAATAG 


GTTCTACCGT 


TTTCTTGGAC 


AAACTGCCAT 


CCTTCAGAGT 


960 


TATCTGCAAA 


TACTGTATTC 


GTAG 








984 



(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 



CCTCGTTATG 


CAGATGAACG 


TTATTTCTTG 


TCAAAGAGTC 


ACAAGAATTT 


TGTTGATCGT 


60 


AATCTTTTTA 


TTACCATTCG 


TGACAAGGAA 


ACCACCTGTA 


TCAAGCCTTA 


TCAGCAGGAT 


120 


TTGGATTTGC 


CACATGGTCT 


GGCCTTGGAT 


GTTTTGCCTT 


TGGATTATTA 


TCCGAAAAAT 


180 


CCAGCTGAGC 


GGAAAAAACN 


GGTTCGTTGA 


GCCTTGATTT 


ATTCACTCTT 


TTGTGCGCAA 


240 


ACTATTCCAG 


AAAAGCATGG 


TGCTCTCATG 


AAATGGGGAA 


GTCGCATTTT 


ACTGGGTTTG 


300 
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ACTCCAAAAT CTCTCCGTTA TCGCATCTGG 
GATTTGGCTG ATTGTGATGG CATTACAGAA 
AAGTACCCAA TCACATCTTT TGAAGACAAT 
CCTATTCCAA TCGGCTATGA TGTCTATCTC 
CCACCAGCAG ACAAGCAGGT ACCGCATCAG 
TATACAGAAT ACAAGGGAGA ATATGGTGGC 
TTTTTCTCTT GGAGGTGGTG CAGAGAAGAT 
AGAAAAGTAT GATATTGATA TTNTTGAAAT 
TCCAAAGCAT GTACGCATTT TAAAATCCCT 
AG 

(2) INFORMATION FOR SEQ 



PCT/US97/05306 

AAAAAAGCTG AGAAAGAAAT GACTAAGTAT 360 

TTATGCTCAG GTCCTGGCTA CATGAGAAAC 420 

CTTTTCTTGC CATTTGAAGG AACAGAGATG 480 

AGAACTGCTT TTGGGGATTA TATGACGCCT 540 

GATGCTGTCA TCGCTGATAT GGATAAGTCT 600 

TAAGAAAAAA ATCTTATTTT TTATGTGGTC 660 

TCTATCAACC ATTGTTTCAA ATCTGGATCC 720 

GGAGCACTTT GACAAGGGAT ATGAATCTGT 780 

TCAAGATTAT CGCCAAACCA GATGGATACG 840 

842 

ID NO: 113: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 749 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:113: 



CTGGCAAATA CAAGGTGACG ATCATTGGTA AATCAGCCCA CGGTGCTATG CCTGCTTCAG 60 
GTGTCAATGG TGCGACTTAC CTAGCCCTCT TCCTTAGCCA GTTTGACTTT GCTGGTCCAG 120 
CCAAAGAATA CCTTGACATC ACTGGTAAAA TTCTCTTGAA CGACCATGAG GGTGAAAGTC 180 
TCAAGATTGC TCATGTGGAT GAAAAGATGG GTGCCCTTTC TATGAATGCA GGCGTCTTCC 240 
GCTTCGATGA AACAAGTGCT GATAATACCA TTGCCCTCAA CATCCGCTAT CCAAAAGGAA 
CAAGTCCAGA ACAAATCAAG TCAATCCTTG AAAACTTGCC AGTTGTTTCT GTTAGCCTGT 
CTGAACACGG TCACACGCCT CACTATGTTG CCAATGGAAG AATCCACTTG GTTGCAAACC 
TGGTTGAAAT GTCTATGAAA AACAGACAGG CCTTAAAGGT CATGAACAAG TCATCGGTGG 
TGGAACCTTT GGTCGCTTGT TAGAGCGCGG AGTTGCCTAT GGTGCTATGT TCCCAGACTC 540 
AATTGATACC ATGCACCAAG CCAATGAATT TATTGCCTTG GATGATCTCT TCCGAGCAGC <™ 
AGCAATTTAT GCCGAAGCTA TTTACGAATT GATCAAATAA AACGATAGAA GTCTGAGATC 
TTATGCTTGG ACTTCTTTTT GGAGGGAAAG TAGATGTCTC AAATCGAAAG AATCAAACAG 
GCTATCATGG CGGATTCACA GAATGCCAG 

(2) INFORMATION FOR SEQ ID NO: 114: 



300 
360 
420 
480 



600 
660 
720 
749 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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Cii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 



CTGGATGAAT TGCAGAAGCG CAACCTACTG GGATTTGTCT TCCAAGATTT TCAACTATTT 60 

CCTCATCTAT CAGTTCTGGA AAATTTGACT TTATCGCCTG TGAAGACCAT GGGAATGAAG 120 

CAGGAAGAGG CTGAGAAGAA GGCGAGTGGA CTCTTGGAAC AGTTAGGACT AGGAGGACAC 180 

GCAGAGTCCT ATCCTTTCTC ACTATCTGGT GGGCAAAAGC AGCGGGTGGC TTTGGCGCGT 240 

GCTATGATGA TTGACCCAGA AATCATTGGC TACGATGAAC CAACTTCTGC CCTGGATCCA 300 

GAATTACGTT TGGAAGTGGA GAAGCTAATC TTGCAAAATA GGGAACTTGG GATGACCCAC 360 

ATTGTGGTTA CCCATGATTT GCAGTTTGGC TGAAAATATC GCACATGTTA TTATTGAAAG 420 

TAGAACCTCA AATAGGAAGA AAAATGGATT GAAAAAATGG ATGCTTGTAT TAGTCAGTCT 480 

GAAGACTGCC TTTGTTCTTA GTA 5Q3 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 



CTTCCTCATG GATAG 



60 



CTGGAGGGCA TTCAGTATTA CTTGAATAGA GGAAAACATC ATCTCCCTTA TGTTCTCAAT 

ATTGGATTTC CTGGTCAGAA AAATGACCTC TTACTCCTTC GGCTAGATTT AGCTGGAATT 120 

TCAATCTCTA CTGGCTCAGC CTGTACTGCA GGCGTTGTCC AATCCAGCCA TGTTCTTAAA 180 

GCCATGTATG GCGCAAATTC AGAACGCTTG AAGGAATCCC TTCGCATCAG TTTGTCGCCA 240 

CAAAATACCG TTGAAGACCT ACAAACCCTC GCAAAAACCT TAAAAGAAAT TATCGGAGGT 300 

TAGCCATATG GCATTTGAAA AAATCATTCA GTTAAAAAAT TGTCGTTACG ATTACACTCT 360 

TAGCCCTTCT GTTAAAAAAT TCACCCTCAA AG ATAAC AC C TTTTTTGAAA CTAAGGTTGG 420 

TAACTATGAA CTGACTCGCC TTTTGGAAAA AGTGCCAAAC AGCGGTGAAG GCTTCCAACT 480 
CAAAATCATC ATTAACAAGG AACTTACAGG GGCTAAAATC AATATCACTG ACAAGTTTGG 



540 



CCTTCGTCTA GTTGATATTT TCAAATCAGA AGACCACCAT ATTCATCAGG AAAAATTCTA 600 



615 



(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 954 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 
<D) TOPOLOGY : linear 



PCTYUS97/05306 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:116: 



60 
120 
180 
240 



CTGCAATGGT TAGCGAGGAT GGTGAAATAG AGGAGCCTTA ACTATTTTGA GGAATCCAAG 
CTGACAGGAC TGGTATGTCT GCTCTTTGGC TCTGAAGGCA GACTATACCA GCTGGAAGTT 
TGACAGATGT GGCACGTGAG AATTATTGGA GGCNTTGACN TANTCAGAGG ATTCTCTCTT 
GATATTGGGT TTGGCAGAGT TGGATAGTGA GTTGGAAAAT TACCAAGCGG TTATTCAAGC 
CTATGCCCAG TTAGATAATC GCTCGATTTA TGAGCAAACG GGCATTTCCA CCTATCAACG 300 
AATTGGCTTT GCCTATGCTC AGTTAGGGAA ATTTGAAACG GCTACTGAGT TTTTAGAAAA 360 
AGCCCTGGAG TTAGAATACG ATGACTTAAC AGCTTTTGAG TTGGCCAGTC TTTATTTTGA 
TCAAGAAGAA TATCAAAAAG CCACCCTCTA CTTTAAGCAG CTTGATACCA TTTCTCCTGA 
CTTTGAAGGC TATGAGTATG GGTACAGTCA GGCTTTACAT AAGGAACATC AAGTTCAAGA 
AGCCCTGCGT ATCGCTAAGC AAGGATTAGA GAAAAATCCC TTTGAAACTC GCCTCTTGCT 
AGCTGCTTCA CAATTTTCTT ATGAATTGCA TGATGCTAGT GGTGCAGAAA ATTATCTCCT 
TACTGCAAAA GAAGACGCTG AGGATACAGA AGAAATCTTG CTTCGTTTAG CCACTATTTA 
TCTGGAGCAG GAGCGTTATG AGGATATTCT AGACTTGCAG AGTGAGGAGC CAGAAAATCT 
TTTGACCAAG TGGATGATTG CTCGTTCTTA TCAAGAAATG GACGATTTGG ATACTGCTTA 
TGAGCATTAT CAAGAGTTGA CAGGAGATTT GAAGGACAAT CCAGAATTTC TGGAACACTA 
TATCTATCTC TTGCGTTGAA TTGGGACATT TTGAAGAAGC AAAATCCCAT GCTC 

(2) INFORMATION FOR SEQ ID NO: 117: 



420 
480 
540 
600 
660 
720 
780 
840 
900 
954 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

CTATGAAACA TTTTGATACT ATTGTCATCG GTGGGGGACC TGCTGGTATG ATGGCTACGA 
TTTCCAGTAG CTTTTATGGA CAGAAAACCC TCCTCATCGA AAAAAATCGG AAACTTGGAA 
AAAAATTAGC TGGGACTGGT GGGGGACGTT GCAATGTGAC CAACAATGGT AGCTTAGACA 
ACCTGCTAGC TGGAATTCCT GGAAACGGAC GCTTTCTTTA CAGTGTTTTC TCCCAGTTCG 
ATAATCATGA CATCATCAAC TTTTTTACAG AAAATGGTGT TAAACTTAAG GTCGAAGACC 
acggIcgcgt CTTTCCAGCC AGTCACAAGT CTCGGACTAT TATCGAAGCT TTGGAAAAGA 
AAATCACTGA ACTAGGTGGT CAAGTTGCTA CTCCAAATAG AAATCGTTTC TGTTAAAAAA 
GTAGATGACC AGTTTGTCCT TAAGTCAGCG GATCAAACCT TCACTTGTGA GAAACTCATT 
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GTCACAACAG GTGGTAAGTC TTATCCTTCG ACTGGTTCGA CTGGTTTTGG TCACAATGCT 
CCGCCATTTA AGCATACCAT CACG 



540 
564 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



CTCCGTGGTC 


TTTTGGTTTG 


CTACGGGATG 


TAGGCAAAAG 


QCAAGTCCAT 


TTTATCAGCA 


60 


ATAATGGCTC 


CGTGTGGAAT 


CCCTGCTGTT 


GCAGTTCCTG 


CAATCACTTC 


TACTTCAGGA 


120 


AAGGCTTCTT 


TGATAGCTTC 


CACAAAACCA 


TTTTCAATTA 


GGGTACGAGT 


TTCTGGATAG 


180 


GCTAGTGTCA 


CACGATTATC 


AGTGTAAATC 


GGTGACTTGA 


TACCAGATGC 


CCAAGTGAAG 


240 


GGTTCCTCTG 


GTTTGAGGTA 


AACGGCTTGG 


ATTTTCAAGA 


GGTGGCTAGC 


GATATCTTTA 


300 


GCAAGTGTCA 


TGGTATTCTC 


CTTTTATTTT 


TCTAATCTAG 


TTCTTTAATT 


CCAGTCCTGT 


360 


GTCCATTCAT 


CCTTGATGGC 


ATGATAAGCT 


GCAACAGGAT 


CCTCAGCTTG 


GGTAATGGGA 


420 


CGTCCCACTA 


CGATATAGTC 


ACTGCCGATT 


TGATAGGCAT 


CAGCAGGTGT 


CATGACTCGT 


480 


TTTTGATCTC 


CAACTGCAAC 


ACCAGCTGGA 


CCAATCCCTG 


GTGTCCCGAC 


AGATAAAAAT 


540 


CTGGATTGGT 


AGCCCTGCTT 


GATGACTTTG 


TCACTTCCCT 


GAGCCGAGCC 


AAACCAACAC 


600 


CCATCCCAAA 


GCCCAAGCTT 


CAGCTGTCTT 


CCTTGGCATA 


GTTGAATCCA 


C 


651 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



CTTAGTATCT 


AGAAAAGGAG 


AAATAAAATG 


GTTAAAGTAT 


TAGCAGCGTG 


CGGAAATGGA 


60 


ATGGGTTCAT 


CAATGGTTAT 


CAAGATGAAG 


GTTGAAAATG 


CTCTCCGTAA 


GCTTAATCAA 


120 


ACAGATTTTA 


CAGTCAATTC 


ATGCAGTCTC 


GGTGAAGCTA 


AAGGTTTAGC 


AGTAGGATAT 


180 


GACATCGTAA 


TCGCTTCTCT 


TCATTTGATT 


CAAGAATTGG 


AAGGGCGAAC 


TAATGGGAAG 


240 


TTAATTGGGC 


TTGATAACTT 


GATGGATGAT 


AAAGAAATCA 


CCGAAAAACT 


CAGTCAAGCA 


300 
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ATACAGTAAA AGGTTGGAGG GGGCTGGACA CAAACTGAGA GTTATCGTTT CTGTCCTTCT 360 

CCCTCTTTAA ATAAAGGAGG CAGATATGAA TTTAAAACAA GCTTTAATTG ACAACGACTC 420 

GATCCGACTA GGTTTAGAAG CTAACGAATG GAAAGAAGCA GTCAAGGTAG CAGTAGATCC 480 

CTTGATTGAA AGTGGGGCAA TTTTGCCAGA GTATTACGAT GCTATCATTG AATCGACTGA 540 

AGAGTATGGG CCTTACTATA TCTTGATGCC AGGTATGGCT ATGCCCCACG CTAGACCTGA 600 

AG 602 

(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 



ATTGGGAATT 


TTTGCAACTG 


CAATTGGTGC 


CCTCAGTAAT 


CTATAAAATA 


GATTCAAGAA 


60 


AATTTAGTGA 


CTGGGATTTC 


CCAGCCCTTT 


TTTAAAGTGA 


GAAGAAATAA 


TGAGTATGTT 


120 


TTTAGATACA 


GCTAAGATTA 


AGGTCAAGGC 


TGGTAATGGT 


GGCGATGGTA 


TGGTTGCCTT 


180 


TCGTCGTGAA 


AAATATGTCC 


CTAATGGAGG 


CCCTTGGGGT 


GGTGATGGTG 


GTCGTGGAGG 


240 


CAATGTGGTC 


TTCGTTGTAG 


ACGAAGGACT 


ACGTACCTTG 


ATGGATTTCC 


GCTACAATCG 


300 


TCATTTCAAG 


GCTGATTCTG 


GTGAAAAAGG 


GATGACCAAA 


GGGATGCATG 


GTCGTGGTGC 


360 


TGAGGACCTT 


AGAGTTCGAG 


TACCACAAGG 


TACGACTGTT 


CGTGATGCGG 


AGACTGGCAA 


420 


GGTTTTAACA 


GATTTGATTG 


AACATGGGCA 


AGAATTTATC 


GTTGCCCACG 


GTGGTCCGTG 


480 


GTGGACGTGG 


AAATATTCGT 


TTCGCGACAC 


CAAAAAATCC 


TGCACCGGAA 


ATCTCTGAAA 


540 


ATGGAGAACC 


AGGTTCAGGA 


ACGTGAGTTT 


ACAATTGGAA 


CTTAAA 




586 



(2) INFORMATION FOR SEQ ID NO: 121 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEIK3TH: 680 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

TTCGACAATC TTGCCCNNGA TCCAAGACAA CCACTTCTCT GTCCGCTCAG CAATAGTCAA 
GCGGTGAGCA A1GAAAATCA AAGCTCrTCT CCAAAGCCAT GAGATAATGG ACAATCCGTT 120 

i?0 
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TTTTTGTCAA AATATCCAAA CTGTTAGTCG CCTCATCCAA AATCAAGACC GGCGCATCTG 180 

TCAAGAGAGC ACGCGCCAAA GCGATTCTCT GACGTTGACC ACCTGAAATC CCTGCCCCAT 240 

CCGAAGTCAA TTCTGTCTGG TAAATTTCAA GGTGGCATGG CGGCTTCGAG ATATTCCTCT 300 

TCGGAATCTC TGAACCAAAT TCGGACCGGC CCGTTAAAGA TATCTTCCCT GTGTCGTCCC 360 

CTCCCTTGGC TCCCAAAAGA AGATTCTCCA AAATCGTTCC GTTAAAGACA TAGGGCTGTG 420 

GAGGCAGATA GATGATGTAC TGGCGTAGGG CTTTTTTATC AATCTGATTG AGATTGACAC 480 

CACCCAGACT AATCTCCCCT TGACTTGGGT CGTAAAAATT AACCATCATC TTGGCCAAAG 540 

TCGTCTTACC TGACCCTGAA ATCCCCACAA AAGCCACCTT AGACCCTTGG GGAACGGTTA 600 

AATTGATATC CGACAAGACG TCTCGACCAT AGCCATACTT GTAATGAACC TGCTTGAAAG 660 

TCATCTCTCC CTTCATCAAG 680 

(2) INFORMATION FOR SEQ ID NO: 122: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 



CCAATTTGGA 


TTTGCGAGAA 


GGCGTTATTA 


TTTCAGATGC 


TGGTTCGACC 


AAGTCAACCA 


60 


TTGTGGATGC 


GGCGGAGCAG 


TATTTGGCTG 


GCAAGTCTGT 


TCGCTTTGTC 


GGGGCCCATC 


120 


CCATGGCTGG 


TAGTCACAAG 


ACAGGGGCTG 


CTTCGGCAGA 


TGTCAATCTT 


TTTGAAAATG 


180 


CCTATTATAT 


CTTTACAACC 


TTCCAAAGCC 


CTGAACAAGT 


CAAGGACAAC 


GCTTAAAGGA 


240 


AATGGGAAAG 


GATCTGCTTT 


CCAGGTCTTC 


ATGGCTCGTT 


TTATCGAGAT 


TGATGCCAAG 


300 


GAGCATGATC 


GTGTCACTTC 


TCAGATTAGC 


CATTTTCCTC 


ATATTTTGGC 


TTCTAGTCTC 


360 


ATGGAGCAGA 


CTGCGGTCTA 


TGCTCAAGAG 


CATGAGAATG 


GCAAGGCGCT 


TTGCGGCAGG 


420 


TGGTTTTCGA 


GATATGACCC 


GAATTGCGGA 


AAGCGAGCCA 


GGAATGTGGA 


CCTCCATTCT 


480 


CTTGTCCAAT 


AGCGAGACCA 


TTCTGGATAG 


AATTGAGGAT 


TTCAAGGAAC 


GTTTGGAAGC 


540 


GATTGGTCAG 


GCCATTAGTA 


AGGGAGATGA 


AGAGCAAATT 


TGGAACTTTT 


TTAACCAAGC 


600 


G 












601 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

CTGGCTCATC AGGGGCAGGA TATTTATGAT TTCCCTCGTG CTATGATTAA GGAAGATAAT 60 

CTGGAGTTTT CATTCTCTGG TTTGAAATCT GCCTTTATCA ATCTTCACCA CAATGCCGAG 120 

CAAAAGGGAG AAAGTCTGTC TACAGAGGAT TTGTGTGCTT CCTTCCAAGC AGCTGTACTG 180 

GATATTCTCA TGGCAAAAAC CAAGAAGGCT TTGGAGAAAT ATCCTGTTAA AACCCTGGTT 240 

GTGGCAGGTG GTGTGGCAGC CAATAAAGGT CTCAGAGAAC GCCTAGCAGC CGAGGTTACA 300 

GATGTCAAGG TCATCATTCC ACCTCTGCGC CTCTGCGGAG ACAATGCAGG TATGATTGCT 360 

TATGCCAGTG TCAGCGAGTG GAACAAAGAA AACTTTGCAA ACTTGGACCT CAATGCCAAA 420 

CCAAGCCTCG CTTTTGATAC CATGGAATAA AGAGTGGCTC TTTGTCAAGT GTAGTGGGTA 480 
GGCGAAAAGC TACAATCTGG AGANTACGAA ATTC 



514 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:124: 

CTTGGATTAT AAAATACAAT CGGCAAGATC ATAAGTCCCA AGCCTAAAAA TATATAGAAA 
GGGGGTCACC TTCCAAAGAA ATTCTGTATT AAAGAGCATG ACCACAAAAC CAATCACAAG 
CCCCAAGGCA ATCCAGGCGA CCTGCTGCCC TAAAATGGGC AGAATATTAT TGGGGTAATC 
ATGACTAACG GCTATATAGA TAGCCACCAC ACCGATGACC AGTAGAAAAA ATACTGGCAA 
GAGCAAACTG TAATCGACTC TTGAGTCGAG AGAACGTTTC ATATAAACTA ACCTTATACT 
TTCATACAAT ACTATTTATC AAAGTTCATT AAAAAATCTA TCAATAGCCT CGTCAACTTC 
GGATCGAGAG ATGGTTTTAA CAGTCGCTTC TTCTGCTAGA GATGCTACTA TTTGTTTGCC 
GTATCGTTTT CCGACGATTC TCCTATCCAA AATAAGAGTT AAGGAACGTT GGTATTCACG 
TCTCATACTT CTTCCCAAAG CCTGTTTTAA ACGAATAATG G 

(2) INFORMATION FOR SEQ ID NO: 125: 



60 
120 
180 
240 
300 
360 
420 
480 
521 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DIJA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



CTTGCCTGAT GACATCATTG ACTCTTTTTG GTATATCATC GACCATTTCT TAAAAAATGT 60 

CTTTGAATTG GAAGAAGAAC TCGAGTTTCA ATTGCTTAAT AACCAAGGAA AGATTACCTT 120 

CCACTTTTCA AGTCAACACC TCCCTACAGC CATTGATTTT GACTTTAACC ATCCTTTCGA 180 

CCCTCGTTAT CCCCCAAGAG TACTGGTTTT AGACATGGAC GGTAGAGAAA CTATCCTCCT 240 

CCCAGAAGAA AATGACCTAT TTTAAAAACT CTAGCCTTCA GTTGCAAGTG ACTGAAAACT 300 

AGAGTTTTTC TATTTTTTCA AAGCATCATA CAAGTTGCGG ATCGGTTGTT TTAATATCGG 360 

ATGGATAAAA TGAGGCGCAA TTTCCTGTAA GGACTCAAGG ACAAAAAGGC GTTCCCGCTA 420 

TGTTAAGGGA TGAGGCAATA TGAGGTCGTC TGTATAAAGG ATCTGGTCCT CCCACAAAGA 480 

GCAAGTCCCA AATCAATCCA AACGAAGTCC CCCAATGCAC TTCTCTCACC CGTCCCCAGC 540 

TCTTGACTCC AATGGCTAAC AAGGTTTCNT AACAATTCTT GTTGCTGGTA GCCAAGTTC 599 



(2) INFORMATION FOR SEQ ID NO: 12 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 



CTATGGGTAA 


AAAGGATGCC 


TCTGCTATGC 


ATGAGATGAG 


GGCTTCCTTT 


ATTCAAGGCT 


60 


CCATAGAAGC 


AGGCCATACT 


GCGGAAAAAT 


CAGAGCAGGT 


TTTTGATGTT 


ATGGAGAAGT 


120 


TTGCAGGTTA 


TGGATTTAAT 


AGATCTCATG 


CCTACGCCTA 


CTCAGCCTTG 


GCCTTCCAGT 


180 


TGGCTTATTT 


CAAAACGCAT 


TATCCAGCCA 


TTTTTTATCA 


GGTCATGTTA 


AATTATTCCA 


240 


ACAGTGATTA 


CTTAATAGAT 


GCACTTGAAG 


CAGGTTTTGA 


AGTAGCCTCT 


CTATCCATCA 


300 


ATACTATTCC 


CTATCACGAT 


AAAATTGCCA 


ACAAGTCTAT 


CTATATAGGT 


TTGAAATCGA 


360 


TTAAGGGGCT 


CCAGCAAGGA 


CTTGGCGCTT 


TGGATTATTG 


AACATAGACC 


TTATTCTAAC 


420 


ATTGAAGATT 


TTATAGCTAA 


ATTACCTGAG 


AATTATCTGA 


AACTTCCTCC 


TGCTAGAACC 


480 


TTTGGTAAAA 


GTTGGTCTTT 


TCGATTCATT 


TGAAAAAAAT 


CGTCCAAAAG 


TATTTAATAA 


540 


CTTAGCTATC 


TATTGAATTG 


TGAA 








564 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 127: 



ACTAATCACT 


TACTACGTAG 


CTTCTGACTA 


TCTACCTCTA 


CTAGATAAAG 


GACAAACTGT 


60 


AAGATTAAAA 


CTGGAGAAGN 


TTGGAAATCA 


CGGCATTACC 


ACCATCGGCC 


AACTTCAGAC 


120 


AATTGATCAA 


ACTCCTACCA 


GAACAGAGCA 


AGGCAATCTC 


TTTAAATTAA 


CCGCTCTTGC 


180 


AAAACTATCT 


AATGAGGATA 


GTAAACTCAT 


CCAATATGGC 


TTACAAGGTC 


GCGTCACTAG 


240 


TGTAACTGCA 


AAGAAAACAT 


ATTTTGATTA 


TTTCAAAGAT 


AAAATTTTAA 


CCCATTCTGA 


300 


TTAATTTTCA 


GATAACACTC 


TATAACTATT 


TATTATCTTA 


TCAAAAAGGA 


GAATCATAAC 


360 


ATGGATAAGA 


AACAAAACCT 


AACTTCATTT 


CAAGAACTAA 


CAACTACCGA 


ACTCAACCAA 


420 


ATTACAGGTG 


GAGAATGGTG 


GGAAGAACTC 


TTACATGAAA 


CAATTTTAAG 


TAAATTTAAA 


460 


ATCACAAAAG 


CACTTGAACT 


ACCTATTCAG 


CTATAAAAAC 


AAGACCGAGA 


AACAAGAACT 


540 


CTCGGTCTTG 


TTTTTTATCA 


TTCTGCATGT 


ATCACAGTAA 


GTACCTGACG 


AAAGACTTGA 


600 


TTTTGGCGAG 


GTAGTATT 










618 



(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 



CAGGGGATGC 


CAGTTGATTT 


GCTTCAAGTT 


GACTTGACTC 


GTACTTGGGA 


AATCCTCGGA 


60 


GAAATCACTG 


GGGATGCTGC 


TCCAGATGAA 


CTCATCACCC 


AACTCTTTAG 


CCAATTCTGT 


120 


TTAGGAAAAT 


AAGAAAAATC 


CATGATCCTT 


CATTCGGTCA 


TGGATTTTAT 


TGTCTTTATT 


180 


AGTAATCTGG 


TCTTAAGACC 


CCTGTTACAG 


TTGCCTTAGT 


TGCTTCGTAG 


TCGCCATCTA 


240 


CGACAACCTT 


GATAATGCGT 


TTGACATCTT 


CTTCTGGTGC 


TGGAACAAGA 


GGTAGACGAG 


300 


TGGGTCCAGC 


TTCAAATCCC 


ATATAGTTAA 


GAATTGCCTT 


AACTGGAGCA 


GGACTTGGAT 


360 


AAGAGAAGAG 


AGCATTAACC 


TTAGGAATGA 


ATTTACGCTG 


AATTGCTGCG 


GCTTTCTTCA 


420 


TATCGCTTTC 


TGCAATGGCA 


GTAAACATCT 


CGTGCATTTC 


ATCCCCATTT 


GTATGAGAAG 


480 


CAACAGAAAT 


AACCCATCCG 


CCCCAAGGTT 


CATGGCATGG 


AAAGCATCTC 


CATCCTCACC 


540 


TGTATAAAT 












549 



(2) INFORMATION FOR SEQ ID NO: 129: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 45 5 base pairs 
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<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

CTGGCTANCA AAATCATACT CATAATCCTT AACACCGACA GAGATAGAAG CAATGAGCCC 
TTGATCGTGC ATTCGTTTAA TAAAAGGAAT GCGTCCTGCC TCATCAAAAC GGTGCATAAG 
TGTACAAGTA ACCACCTTTA GCCAGTTGCT CTGCTACATT TTCATCCAAA ATCGTCTGCA 
TATTCGCTGG CACAACAGGT AGTTTAAAGG TGTGATTTCC TAAAGTGACA CTTGTATCCG 
CTTCTGCACG GCTTTTAATG ACACATTTAT TTGGAATCAA TTGAATATCT TCGTAATCAA 
AAATTGGAAA TTCATTTAAC ATATCGATGT CTCGTTTCTT TTGTAATGAC CTACCTATGC 
TCTCGCATCA CTACGCCTTT TCCGACGTTT CCCTTAAATT TATTATAAAC CAAAAGTACA 
GTTTTTGTCA AATTATTTCA TGAATNAAAT ATATC 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CTCCTGGGTT TTGATTAACT CTAGGGTTGC CAAAAAGAGG GTGATGACTT CTTGGACATT 
CTGGGCTTCC TTGAACAAAT CCTGCAAGCG CAATTGATTT CGTCCAATCA AGGATTCTTT 
CACAATAATC ATCATGTCCT CAATCTTATA CTCATCCCGC AAGATAGTCG TGTGATTTTG 
TGCAAACTCC TCTTTTTTCT TGGCTAGGAT ATTTGAAAAA GCCAAAAAGA GGTCAATGGT 

CGTCTTGTCA TGCACAAGCT CCGCATCTTC GTAAATCAAC TCTATCGGCG CTTTGGAATA JWU 

ATACTGGGCC CGTTTCTTGG TGCTTGGCTT CCAAGTGCTC ACCCAAGAGC TTGAACTTGC 360 

GATATTCTTC GATTTGAGAG AGGAGGTCCT GCTCCAGGTC ATCCCCCAAG TCTGTCACTT 420 

CTGCTACCTT CGGAAGGAGT TTACGACTCT TAATCAGCAT GAGCTGACTA GCCATGACCA 480 

TGTACTCACC CGTCACTTCC AGACGCATGG CCTGCAGGGT TGAGACATAG GCTAGATACT 540 

GTTCGATGAC TTCCGTAATG GGCACATCGT AGATATCCAT CTGGTACTTA GAAACCAGAT 600 

GCAAGAGTAA GTCCAGGGGT CCTTCAAAAT CTTTTAATTT AATATCCATT ATCTATATTT 660 

TTCTAAGGTC AGGACTGTTT TTAATCCTAA TTTTTTTGCA ATTTCGTACA AATCGACCTT 720 

GTTTTCTATT TGTCCTTAGA ATAAACTGTT CACGTAAGAC TTGAGATCGA ATTCCT 776 

(2) INFORMATION FOR SEQ ID NO: 131: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

ATTGATACCN ATTGTACAGG ACAACTTATT TTGGAGCGCC ATAGCAGGGA TANGGTTAGC 60 

AATAGATGCG NTGACCATAG AAACGTTTNT TAGCCAATAC ATAACCTGCT AGAGATGAGG 120 

TTGCACAAAC TAAGAACATG GTTACCAATG AGATAAATAC TGAGTTCCAC ATCCATTGCA 180 

AGGCAGGGTT CTGCACCATG AGTTGTTGGA AGTTTTCCAT GGTTGGCATT TTAGGGAACC 240 

ACTGAGGAGG AATAACAATT GTATCAGGTT GTGATTTGAA TGCCCCTGTC AAAATCCAGT 3.00 

AGAATGGAAA GATGNACAGC ACAGTCAACA AGAGCAAAAT GATTGTTGAA ATNACAGTAA 360 
AGGCTGTTAA TGGTTTTTTT TCTGTAGATT GCATAGCTGT CTCCTTTCTT TAGTATTCTA 
CGTCGTTTCC AAGTACTTTA AATTGAACAA AGCTTACGAT AGCAATCATG ACTGCCAAGA 

AGACACCAAT TGTGTTGGCA TAGCCGTATT CTGTCAATTG GAAGGCTTTT TCGTAAAGGT 540 

AGTACATCAA GGTACTTGTT GAGTAGTTTG GACCACCAGA TGTCAAAAGC TGAATCAAGG 600 

CGAAACACTG GAATGAGTTA ATTGTTGTGA TGATTGCAAT ATAAAGAGTT GTTGGAAGAA 660 
GGCTTGGCCA TTTAATCTTC CAAAAAACTT GAAACTCAGT TGCACCATCA ACACG 

(2) INFORMATION FOR SEQ ID NO:132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:132: 



CTGGGCTGAG TTTTCCAGAT ACAAATACTG CAACCGCAAC TGCAATCCCC CAACCCATAG 
TAATCACAAT CCAACCTGCA CTGTTGCTCT TGGTTTTTGG AAGAACCACA CCTGCAACAA 
CACCATTTCC TAGAAGAATC AGGATTAAA3 TCCCTAAAAA TTCTCCAAAT AATTCACTCA 
TCATTTTTCT GTCTCCATTA AAAAGAAGGG GCGGGCGACA AGGATTGCTA CCCTCCACCT 
CTTTTATTTT TTCTTAATTT TTTAATTCTG CTAAGTCGTT TTGAGCAAGA GCTGCTTTTA 
CATCAGCACG GTAAGTTGCT TTTTCTTCTT CTGTCCAGTC ATAGAATCGT CCCATTTCAT 
CCAAAACTGG CTCAACGATA CTATCCAAGC TATCACGCAT AAAGAGCATG TGATTGGTAC 
GACGAAGAAG GAAGTCAACT GGGCTAAGAG TCAACTCATT GCGCATTGCA TAGTGAAGGG 
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ACAAG 485 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 



CTTAATTATA 


ACTTAAACAC 


AAAAGTTTTA 


CACAAACTGT 


GGATAACTCT 


TTTGAAACTG 


60 


TGATTTTCTT 


AATTGAAATC 


TATTTTTTAT 


TTTGTGAATA 


AGATGTGAAA 


AAATAGAGAA 


- 120 


TATGTTAGAA 


TAGAGTCATG 


AAAATTAAAG 


TTGTAACAGT 


TGGGAAACTG 


AAAGAAAAGT 


180 


ATTTAAAAGA 


TGGTATCGCA 


GAGTATTCAA 


AACGAATTTC 


TAGATTTGCT 


AAGTTTGAAA 


240 


TGATTGAGTT 


ATCAGATGAA 


AAAACACCAG 


ATAAGGCCAG 


TGAATCAGAA 


AATCAAAAGA 


300 


TTTTAGAAAT 


AGAAGGTCAG 


AGAATTTTAT 


CAAAAATTGC 


TGACCGTGAT 


TTCGTTATTG 


360 


TGTTAGCCAT 


TGAAGGGAAA 


ACTTTCTTCT 


CAGAAGAATT 


TAGTAAGCAG 


TTAGAAGAAA 


420 


CTTCTATAAA 


AGGANTTTCT 


ACTCTTACTT 


TTATTATTGG 


GGGAAGTTTA 


GGATTGTCAT 


480 


CATCTGTAAA 


AAATAGAGCC 


AATCTTTCTG 


TCAGTTTTGG 


TCGCCTAACC 


TTGCCTCATC 


540 


AGTTAATGAG 


ACTAGTTCTT 


GTTGAACAAA 


TCTATCGCGC 


TTTTACGATT 


CAGCAGGGAT 


600 


TCCCCTACCA 


TAAATAGAGA 


ATTGACTTTT 


AATTGAATTT 


TTGGTAGAAT 


AATTGTGTTA 


660 


GGTCTCATAG 












670 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



CTATAATTTT 


ATTAGCTGTA 


TTCCAGTCGT 


CACGACCAAA 


CTCTGTTACA 


GGGACACGAA 


60 


TGTCAAAACG 


GTTCTCAATC 


TCCACAATCA 


ACTCAACCGT 


TCCCATACTA 


TCCAAGACAC 


120 


CTGCATCAAA 


AAGATCTTCA 


TCCATCATGT 


CAGAAACATC 


TTCCATAAAC 


AACTCATCAA 


180 


TAATTTCGAT 


AACTTCTGAT 


TTGATATCCA 


TATTTTATTT 


CCTTTTATTT 


TTTAAACCAT 


240 


AGATTATTCA AGAATCCAGA 


AAAGATTAAG 


AATGACAACA 


TGACAACATG 


GAAAGTGACA 


300 



137 



WO 97/37026 PCT/US97/05306 

ACCATGCCAA GCAACTGAAT CCAGCGATTC TCAGGTAGGG CAGCCTTCCC TGCTTTTTTC 360 

CGTTCCTTAT TGAGCGTTTT TTTCTTGCGA ACCCAGGCAT CATTGATGAA CAAGCCTAAG 420 

TCCATGAAAG AGTCCCATAG GCGATATAGT ACCAGGTCAC ACCATGCCAA AATCCCCATA 480 

ATCCAGCATA TTTACAATGT AAGATC 506 

(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 921 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:135: 



ATGGTTCGAC 


CCACAAATAT 


GTTTTGCGAA 


TCATTGACCA 


TTCTATGATT 


TACGTTGCCA 


60 


TTGCCGGCTC 


ATACACGCCC 


GTTGTCTTGA 


CCTTGATGAA 


TAACTGGTTT 


GGCTATCTGA 


120 


TTATTGTCAT 


CCAATGGGGA 


ACGACCATCT 


TTGGTATTCT 


CTATAAAATC 


TTTGCTAAAA 


180 


AGGTCAATGA GAAATTTAGC 


CTTGCTCTTT 


ACCTGATTAT 


GGGCTGGTTG 


GTTCTGGCTA 


240 


TCATTCCTGC 


CATTATCAGT 


CAAACNACAC 


CCGTTTTCTG 


GATTCTCATG 


GTAACTGGCG 


300 


GACTCTGTTA 


TACAGTTGGA 


GCTGATTGAT 


AGCCTTTATG 


CATTGAAATT 


TGACAGTTTG 


360 


ACTTATGAAA 


ATAAAGCAGA 


GGTTATGGAC 


TTTATCAAGG 


CTCGTGTTGA 


TAAGATGATG 


420 


GGCTCTACTC 


CAAAAGATAT 


CAAGGGAAGC 


AGTTCTTGCA 


GGTTCAAACT 


TTGTTGTGGC 


480 


AGATATGTTT GGAAGCAGCA AGTGCTCTCG 


TAGAAGTAAG 


CAAGGAAGAA 


GATTTTAAAC 


540 


CATCTGTTGA 


ATCACTTTCT 


CGTGCCTTTA 


ACCTGGCTGA 


GAAGGCAGAA 


GGGGTTGCTA 


600 


CAGTTGATTC 


AGCACTATTT 


GAGAATGACC 


AAGAAAAAGC 


TTTGGCAGAA 


GCAGTAGAAA 


660 


CACTCGTTTT 


ATCAGGACCT 


GCAAGTCAGC 


AATTGAAACA 


ACTCTTTGCG 


CTTAGCCCAG 


720 


TCATTGATGC 


TTTCTTTGAA 


AATACTATGG 


TAATGGCTGA 


AGATCAGGCT 


GTCCGTCAAA 


780 


ATCGTTTGGC 


AATCTTGTCA 


CAACTAACCA 


AGAAAGCAGC 


TAAGTTTGCT 


TGTTTTAACC 


840 


AAATTAACAC 


TAAATAAAAT 


TTGATAAACG 


GACTTTATCT 


TATTACAAAG 


GAGAAGAAAT 


900 


GGATCCGAAN 


ANATTGCTCG 


T 








921 



(2) INFORMATION FOR SEQ ID NO: 136: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: 

CTTGGCAACA GGCTCTTTTT GTATTGCGTT 
CATTTAACTA AAAATTATTG TATGTTATCT 
AGAAAGATTT GTGAATACAA ACTATTTCTG 
TGGGAGATGG GATGAAAGAA TTTCAATTTG 
ATGCAATAGG AGCTTGTTCG GTCTTGCTAG 
CTCAGCCTGT ACAGGATACA GAAACGAGTT 
AGCAGGATTT ATCTGAAAAG CTGAAATCTG 
TTTTGAACTT ATGGGAACAT TAGGTTACTA 
ATACAAAAAC CAAGCTGAAG AAGAAGGCAA 
ATCTTAAGAA AAACATATGC ACCTTCAAAA 



PCT/US97/05306 

SEQ ID NO: 136: 

TATTAAGAAA AACAAAAAAC ATCGATATAA 60 

CCCTTAATTA GGAATGATAA GGGAATAACT 120 

ATATACAAAA TATACAGTAA TAATGAATGA 180 

AGAGAAAGCA GCGTTTTTCT TTGAGGCCAT 240 

GAACGAGTTT ATTTTTTGCT GGTATGGGTG 300 

CAGCACTAAT TTCAAGTCAT TATTTGGATG 360 

AGTTGCAATG GTTTGAATTA GAAAACAAGC 420 

TGAAGGATTT GTTCCTTATG TTTCAAATCA 480 

ACCGCTATCT GATAAATATA TTTTCGAAAA 540 

A 571 



(2) INFORMATION FOR SEQ ID NO: 137: 



<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 



GTATCAACTC CAAAGAGCAA GTGACTGTCA AAGTTGTAAC AGATGCGGCC AAGAAGCTCA 60 

TGGGTGATAA GATTGCTCGC CAGAAAGAAC GTGGGATTCA GATTGANACC TTGCGNACCA 120 

TGATTATNGG GATTCCANAC GNTGGTAAAT CCACTCTGAT GAACCGTTTG GCTGGTAAAA 180 

AGATTGCTGT TGTTGGAAAC AAGCCAGGGG TCACAAAAGG TCAACAATGG CTTAAAACCA 240 

ATAAAGATCT GGAAATACTT GGATACACCG GGGATTCTCT GGCCTATAGT TTGAGGATGA 300 

AACTGTTGCA CTTAAGTTGG CATTGACTGG AGCTATCAAG GATCAGTTGC TTCCTATGGA 3 60 

TGAGGTTACC ATTTTTGGTA TCAATTATTT CAAAGAACAT TATCCAGAAA AGCTGGCTGA 420 

ACGCTTCAAA CAAATGAAAA TTGAAGAAGA AGCGCCTGTG ATTATTATGG ATATGACCCG 480 

CGCCCTCGGT TTCCGTGATG ACTATGACCG TTTTTACAGT CTCTTCCGTG AAGGAAGTCC 540 

GTGATGGCAG ACTCGGTAAC TATACCTTAG ATACATTGGA AGACCTCGAT GGCNACGATT 600 

GAACTAAGTC C fi1 i 



(2) INFORMATION FOR SEQ ID NO: 13 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



60 



TTGCTAAAGC AGCCATGATG ANGGCACTCC ATAAGGATAA GTACCACCAA GCAGCAGCAG 
ANGACCATAA TCTCCTTTAT GACTTGAACG AGAACGTTCA ATAATAACTT TTTCTAGTAA 120 
GGTTTGATTA ATCACTTTCA TCCTTTTTCC CTCTCACTTT TATTATACAA CAAAAAGGAG 180 
ACGCAGACCT CCTTTTGTAA TCTTATATCT AAAATTTAAT ATTCATTTCT GCCATTTTAG 240 
ATATAGCTAT AGAAAATACA CTCTATTAAT CGAATGTTTC TCTTATTTTC TATCCAATGT 300 
CCGAAGTGCT GCTTGATAAG TTTGCTCCAT CAGCATGGTA ATGGTCATAG GACCGACACC 360 
TCCAGGGACT GGCGTGATAT GGCTAGCAAG TGGTGCAACT GCCTCATAAT CAACATCTCC 
ACAGAGCTTC CCATTTTCAT CTCGGTTCAT CCCAACGTCA ATGACAACCG CACCTGGTTT 
GACAAAGTCA GCAGTCACAA ACTTGGCGCG GCCGATTGCG ACTACAAGAA TATCTGCTTT 
AGCAGCCACC TTGGCAAGAT TATGAGTTCG TGAGTGGGCC AAGGTTACTG TCGCATTTTT 
AGCCAAAAGA AGCTGAGCCA TAGGTTTTCC AACGATATTT GAACGACCGA TTACGACCGC 
ATTTTTACCT TCCAAGTCAA TCCCATATTC ATGAAACATT TCCATAATTC CTGCAGGTGT 
CGAGGGAATC ATGACTGGAT GTCCAGACCA AAGACGTCCC ATGTTTAGGG GATGGAAACC 
ATCCACATCC TTTTCTGGGT CAATGGCTAA TAAAACCGCC TCTTCATCGA TATGTTTTGG 
TAATGGCAAC TGGACCAAAA TCCCATGCCA AGCTGGATCC CTGATTATAT TTAGCAATCA 
GGTCTAACAA TTCCTCTTGA GTTAATGGTC TCTGGAACTC GCACTACTTC GCTACGGGAA 
CCAGCCGCAA GAGCTGACCT CTCCTTGTTG CGAACGTTAA ACTTGGCTGG CTGGATTATC 
CCCAACCAAA ATCACTACCA ACCAGGCACT AGAG 

(2) INFORMATION FOR SEQ ID NO: 139: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

CTGAACTGGA TTTTGCTACA AGGTCTGATT TGTCAAAGAG ATGCGAGTTA TGATATGAAG 
CAGGATGATT TGGATAAGGT AGCAGATTAT CTCTTCAAAA CAGAAGAATG GACCATGTAT 
GAGTTGATTC TTTTCGGTAA CCTCTATAGT TTCTACGATG TAGACTATGT CACTCGGATT 
GGTAGAGAAG TTATGGAGAG GGAGGAAT7T TACCAAGAGA TTAGTCGCCA TAAGAGATTA 
GTGTTGATTT TGGCCCTCAA TTGTTACCAG CATTGTTTAG AGCATTCTTC TTTTTATAAT 
GCCAACTATT TTGAGGCTTA TACAGAGAAG ATTATTGACA AAGGTATTAA GCTTTATGAG 
CGTAATGTTT TCCATTATTT AAAAGGTTTT GCCTTATATC AAAAAGGACA GTGTAAAGAA 
GGCTGTAAGC AGATGCAAGA GACCATGCAT ATTTTTGATG TGTTAGGTCT TCCAGAGCAA 
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GTTAGCCTAT TATCAGGAAC ACTACGAAAA ATTTGTCAAA AGTTAATTTT CCCAAATAAG 
GGAAAAAATA AAAAGCTCCT TTCGGTTTTG ATACAATAGT TTCAAAATTT GAGAGGAG 



540 
598 



(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 



CTACTTCCCG 


GCCGAGTCGA 


TTCTCCGCCG 


ATATCGAGCA 


TACTGGCTCC 


TTCTGTTATC 


' 60 


AATTTACGAG 


CCTGCTGGAG 


CGCCTGCTCA 


AGAGCAAAAA 


ATTGACCACC 


GTCCGAAAAG 


120 


GAGTCTGGGG 


TTACATTGAT 


AATTCCGCAA 


ATAGCTGTCT 


TTGCATGAGT 


GGCTTTACTG 


180 


GACATATCGG 


TCACTCCCTC 


AAGGCTATTC 


ATCATATTAT 


TTCTCTATTT 


TACCATAAAA 


240 


AGAAAAAGAT 


GGACACGATT 


TCATTCATCT 


TTCTCCCAGT 


AGAAACAAGT 


AAGCAATTGT 


300 


CAATAATCTT 


AAACAGAAAT 


CCCTAATGTC 


CGACTCATAA 


TCACCACAAG 


AGCCAACAAA 


360 


CAGAAAGCAA 


TCCCATTAAC 


AATCATGTGA 


AGTAAGATCG 


AC ATTTC C AA 


ACGTTGGGTC 


420 


TTGTAGGCTG 


TCCAAGATAG 


AACTGTCGAC 


ATACCTCCAT 


AAATCAATAA 


AGAAGGTAAA 


480 


TTACTTGGTT 


GATGTAATAA 


AGCAAACACA 


ATCGTACCGA 


CTACAAAT 
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(2) INFORMATION FOR SEQ ID NO: 141: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:141: 



CTATGAGAGA 


GAATGACCTT 


CTCTTGATTA 


CTGCGGACCA 


TGGAAATGAC 


CCAACGTATG 


60 


CAGGAACGGA 


TCACACTCGG 


GAATATATTC 


CATTGTTGGC 


CTATAGCCCT 


GCCTTTAAAG 


120 


GAAATGGTCT 


CATTCCAGTA 


GGACATTTTG 


CAGATATTTC 


AGCGACTGTT 


GCCGATAACT 


180 


TTGGTGTGGA 


AACTGCTATG 


ATTGGGGAAA 


GTTTCTTAGA 


TAAATTGGTA 


TAAGATGACG 


240 


CGCTATGCTT 


TGCTGGTGAG 


AGGTATCAAT 


GTTGGTGGTA 


AGAATAAGGT 


CGTCATGGCG 


300 


GAGCTTCGTC 


AAGAATTGAC 


AAACTTGGGA 


CTGGAAAAAG 


GTTGAGAGCT 


ACATCAATAG 


360 


TGGCAATATT 


TTCCTTTACT 


TCGATAGATT 


CCAAAGCCCC 


AATTGGTTGA 


AAAAGCTAGA 


420 
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GACTTTCTTT GCAGTCCATT ATCCATTTAT TCCAGAACTT TTTCCTTTAC TGAGTCTAGA 480 
GGACTTTGAA GGGCGGAACT TGAAA 505 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:142: 

CTATTTTCTC ATCTTTTTTA GACTGGTAAT GTATCCCCTA GAGTTTCATT GTTAAGACCT ' 60 
GTCGTATTGA CAACAGTGGA TCCTGATTCC CCATTCCAAG AGTTTTGCCC GCATCATATN 120 
GACGAAATCT TTTTCTGAGC CAGCAATTTT CTCAGCTAGG GCAATAGCGG CGCTGTTGGC 180 
ACTAGATACC AGAGTTGCTT CAAGCAACTC TTCGACAGTA TAATTACGGG CCTCCATAGG 240 
AATATTACTG GCTTCAGAAT TTGTCGTCAA TTGATAAGGA TAATCAGAAA TATCTACAGG 300 
AGTGGAGAGG GTAATACTTC CGTTTTCCAA AGCTTCATAG ACCAGATAAA CAGTAATCAA 3 60 
TTTTGTTATG GAAGCAATTT CGACAGGTGG CGTTGCATCC TTCTCATAGA GAATTTTACC 420 
AGTATTTGCC TCAACAGCAA TCGCATGTTT AGCGGCAATG GTAAAATCTT GAGCAACAGC 
AGTAGAAGCA CCCCCTAAAA GAGAGACAGT TAACAAAGTT AAAAATATTT TTTTCATAGT 
AGTCTTATTC TATCATAAAG AAAAAAAATA TTCTTGCTTT AATAATTCAT CTGTTAAGCT 
TTTTGAAAAT ATGGTAAAAT AAAGTGAGGG AGGTAACTCA TGTTTCGTAG AAATAAATTA 
TTTTTTTGGA CCACAGAAAT TTTACTCTTA ACCATCATCT TTTACCTATG GAGACAGATG 
GGGTCTTTGA TTAACCCTTT TGTTAGCGTG CTTAATACAA TTATGATTCC ATTTTTATTA 
GGGGGCTTTC TTTATTATTT GACAAACCCT ATTGTTACTT TCTTAAATAA AGTCTGTAAA 
CTCAATCGTT TGCTTGGTAT TTTAATTACC TTGTGT 



480 
540 
600 
660 
720 
780 
840 
876 



(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 521 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

CTTGTCAAAT GCCATGGAAG GGGTTCTTTA CTTCCTCAAA CCAGACTTTT CAAAACTGAC 60 

142 



WO 97/37026 PCT/US97/05306 

TAGCGCTGAT CTCCTCTATG CTCTGGGACA ATCTTTCTTT GCCCTCTCAC TAGGGGTTAC 120 

AGACATGTTG ACCTATGCTT CTTACTTGGA CAAGAAAACC AATCTAGTCC AGTCAGGAAT 180 

CTCCATCGTA ACCATGAATA TCTCGATAGT CCATCATGGA AGGTCTAGCC ATTTTCCCAG 240 

CCATGTCAGC CTTCAATATC CACTCTGAAA GGGGACCCAG CCTGCTCTTT ATCGTCTTGC 300 

CTCAACTCTT TGACAAGATG CCTTTTGGAA CCATTTGCTA CGTCCTCTTC CTCTTTGCAA 360 

CTGTCACTTC TTCTGTCGGT GATGCTGGAG ATCAATGTGG GCAATGTCAC CAACCAGGAT 420 

AACAGCAAAC GTGCCAAATG GAGTGTTATT TTAGGAATTT TGACCTTTGT CTTTGGCATT 480 

CCTTCAGCCC TATCTTACGG TGTCATGGCG GATGTTCACA T 521 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 



CTGCAGACAA 


TAACTTGGTT 


AGCTTAACGG 


CTCTTGAGGA 


TTCATCCAAG 


GATGTAACCT 


60 


TTACCAGTTC 


GGCTTTCAAT 


CTAAAAGAAG 


GGCGACACCT 


TCAAAAAGGG 


GATTCCAAGA 


120 


AAATCCTTAT 


CCACGAAGAA 


TTGGCTAAGA 


AGAACGGTCT 


TTCACTTCAT 


GACAAGATTG 


180 


GCTTGGATGC 


TGGTCAGTCT 


GAATCTGGAA 


AAGGACAAAC 


AGTAGAGTTT 


GAGATTATCG 


240 


GCATCTTTCC 


TGGTAAAAAA 


CAAGAGAAAT 


TCACAGGCTT 


GTCTTCTGAC 


TTCAGTGAAA 


300 


ATCAAGTCTT 


TACAGATTAT 


GAAAGTAGCC 


AAACCCTTTT 


GGGCAATAGT 


GAAGCTCAAG 


360 


TCAGTGCAGC 


ACGCTTCTAT 


GTTAGAAAAT 


CCTAAGGAAA 


TGGACGGACT 


CATGAAGCAG 


420 


GTAGAAAACT 


TGGCCTTGGA 


AAATCAAGGC 


TACCAAGTCG 


AAAAGGAAAA 


CAAGGCTTTT 


480 


GAACAAATCA 


AAGACTCAGT 


TGCAACTTTC 


CAAACCTTCC 


TGACCATCTT 


CCTTTATGGA 


540 


TGTTGATAGC 


AGA 










553 



(2) INFORMATION FOR SEQ ID NO : 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:145: 
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CTATGGATGC 


GGAAGTTTCT 


AAGAACCTTC 


GCTTGATCCT 


TGAGCGTAAA 


GGAATGACCA 


60 


TCTTGACTGG 


TACTAAACTG 


CAAGAAATCA 


TTGAGGAAAA 


TGGTCAACTT 


CGTATCAAGG 


120 


TTGAAGGAAA 


AGACAATATC 


ATCGCAAGCA 


AAGCTCTTCT 


TTCAATTGGT 


CGTATGCCAG 


180 


ACCTTGAAGG 


TATTGGAGAG 


GTTGAGTTTG 


AATTGGATCG 


TGGTTGTATC 


AAGGTCAATG 


240 


AATACATGGA AACTTCAGTT CCACGCATTT 


ATGCCACCAG 


GTGACATCAA 


CGGTACTAAG 


300 


ATGTTGGCTC 


ACGCAGCTTT 


CCGCATGGGT 


GAAGTTTCCG 


CTGAAAATGC 


CCTTAAAGGA 


360 


AATCATGCAG 


TTGCCAAATT 


GAATTTGACT 


CCTGCAGCCA 


TCTACACTCT 


CCCTGAAGTA 


420 


GCAGCAGTAG 


GTTTGACAGA 


AGAACAAGCC 


CGTGAGAAAT 


ACGATGTTGC 


CATCGGTAAG 


480 


TTTAACTTTG 


CTGCTAACGG 


TCGTGCTATT 


GCATCTGACG 


CAG 




523 



(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 



CTGCAGGTGG AATTATGAAT 


GTCGGATATG 


AAAAAGCATT 


CTTGATGCAG 


ACATCGTTAA 


60 


ATTTGCCAAC TTCTGAAATT 


ATCTCGACAT 


ATGTCTATAA 


AGTTGGTCTT 


GTATCAGGAG 


120 


ACTATTCTTA CTCAACAGCG GTTGGTTTGT 


TTAATGCAGT 


GATTAACGTA 


GTATTGCTTG 


180 


TTGCAGTTAA CCAAATCGTT 


AAACGCATGA 


ATAATGGTGA 


AGGAATTTAA 


GGAGGAAAGT 


240 


ATGAAAAATT CGATTATGGA 


TACAAAATTT 


GATAGACGTA 


TCTTACTCTT 


AAATAAAATC 


300 


ATTATTGTCT TTATCGTTTT 


GATGACTTTG 


CTTCCTTTAC 


TTTATATCGT 


CGTAGCATCC 


360 


TTTATGGATC CTAAGGTTCT 


GGTTAGTAGA 


GGGATTAGCT 


TTAATCCAGC 


CGATTGGACT 


420 


GTAGAAGGTT ACCAGCGTGT 


TATTCAGTGA 


CCAATCTATT 


CTAAGAAGTT 


TTATCAATTC 


480 


CCTACTATAC TCTTTTGGAT 


TTGCAGCT 








508 



(2) INFORMATION FOR SEQ ID NO: 147: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 147: 
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CTAAAAAGAA 


CCAGATGGAT 


CTTGTGATGA 


ATTTACATCA 


TTTTGATTTA 


CCAGTGGAAC 


60 


TTCTTCAAAA 


ATACGGTGGT 


TGGGAAAGCA 


AACATGTAGT 


GGAGTTATTC 


GTGAAGTTTG 


120 


CC AAGACTGC 


TTTCAACATG 


CTTTGGAGAT 


AAGGTTCATT 


ACTGGACAAC 


TTTCAATGAG 


180 


CCAATGGTCA 


TTCCAGAAGC 


AGGATACTTA 


TATGCTTTCC 


ATTATCCAAA 


TCTAAAAGGA 


240 


AAGGGAAAAG 


AGGCCGTACA 


AGTCATCTAT 


AATCTAAACC 


TTGCTAGTGC 


AAAAGTGATT 


300 


CAACTATATC 


GCTCATTAGG 


ACTTGATGGA 


AAGATTGGGA 


TTATTTTAAA 


CTTGACACCT 


360 


GCTTATCCAA 


GAAGTAATTC 


TCCAGAAGAC 


TTAGAAGCAA 


GTCGATTTAC 


AGATGACTTC 


420 


TTTAACAAAG 


TCTTCCTTGA 


ATCCAGCTGT 


TAAAGGAACT 


TTCCCAGAAA 


AGATTGGTAA 


480 


AAACAGCTAG 


AGAGAGATGG 


CGTGTTATGG 


AGTCATACCG 


AAAAAGAG 




528 



(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 



ACTCTTATTG 


GTGAGGTATC 


GCAACTTCTG 


TTTCTGCCTT 


CCTTGGTGGA 


CCAGCCAATA 


60 


CAACTTACGG 


AGAAAATACA 


GGGGTTATCG 


GTATGACTCG 


TATCGCTTCT 


GTCTCAGTTA 


120 


TCCGTAACGC 


TGCCTTCATC 


GCGATTGCCC 


TCAGCTTCCT 


TGGTAAATTC 


ACTGCCTTGA 


180 


TTTCAACTAT 


TCCAAACGCT 


GTACTTGGTG 


GTATGTCAAT 


CCTTCTCTAT 


GGGGTTATCG 


240 


CCAGCAATGG 


TTTGAAAGTC 


TTGATTAAAG 


AACGTGTGGA 


TTTCGCTCAA 


ATGCGAAACC 


300 


TCATCATCGC 


AAGTGCTATG 


TTGGTTCTTG 


GACTTGGGAG 


GAGCTATCCT 


TAAACTTGGT 


360 


CCAGTTACAC 


TTTCAGGTAC 


TGCCCTTTCA 


GCCATGACAG 


GAATCATCTT 


GAACTTGATC 


420 


TTGCCATACG 


AAAATAAAGA 


CTAAGAGTCT 


AAATACACCT 


AATCCACTCA 


GACAGCTGAG 


480 


TGGATTTTTC 


GTATACCATA 


ATAAAAGTGT 


CTTAACAAAA 


TTATTAAAAT 


CAAAAAACGT 


540 


ATAATATCAG 


ATATTCTAAA 


ACCTTGATAC 


TGTACGTTTT 


ATCATAGAAA 


TTTTTACTTT 


600 


ATTTTCTCAT 


CAAATGAGAT 


TTGCATCAAT 


CTCTTGTCTT 


ACTTGCGTTT 


CTTCTTCGCT 


660 


TTCTTCATTT 


TGTTAGCCAT 


ACGTTTCATG 


GACTGTTTCA 


TGGCAAATTC 


ACCAATTTTA 


720 


CCTTTCAAAC 


CGCCACCAAA 


CATCTGGCTC 


ATATCTGGCA 


TTCCTGCTCC 


TCCGAGAG 


778 



(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:149: 





TGCTATTNGT 


GAGGGAAAAG 


GTTTGAACTT 


TGAAATCGAA 


GTGGATGGTG 


60 


GGATTGATGA 


CCAAACTATT 


GCTCAAGCCA 


AAGAAGCTGG 


TGCGACTGTT 


TTTGTAGCAG 


120 


GTTCCTATGT 


CTTTAAGGGA 


GAAGTCAATG 


AGCGAGTACA 


AACTCTCAGA 


AAACAACTGG 


180 


ACTAGGGTTG 


CAGTTTTTGC 


AGGCGGAAAC 


CGCGGTCATT 


ATCGGACAGA 


TTTTGATGCT 


240 


TTTGTTGGGG 


TGGATCGAGG 


CTCGCTCTGG 


GTCTTGGAAG 


AAGACTTACC 


TCTTGCTCTA 


300 


GCAGTCGGAG 


ATTTTGATTC 


TGTGACGGAA 


GAAGAGCGAC 


AGGTGATTCA 


AAAACGTGCC 


360 


CAGTATTTTG 


TCCAAGCCCG 


GCCAGAAAAA 


GATGATACTG 


ATTTGGAATT 


GGCTCTCTTA 


420 


ACCATCTTTG 


AACAAAATCC 


TCAGGCTGAG 


GTCACTATTT 


TCGGTGCCTT 


GGGTGGCCGT 


480 


ATTGACCATA 


TGTTGGCCAA 


TGTTCTTTCT 


ACCTAGCAAT 


CCCTAAGTTG 


GCACCCTATA 


540 


TGCGTTCNAA 


TAGAAATTGA 


GGAATGGGCA 


AATTTGATTG 


CCTATTGTTC 


CAGAAGGGAA 


600 


CAGTCAGATT 


CGAATCCCTG 


CA 
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(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



CTATTATCAG 


CACTTTTATA 


CCTATTACCA 


AAAAAATCCT 


CAAAATGATT 


GGCAAAAGGC 


60 


TTTATCCACC 


AGCCATTTAT 


CAGCAATATT 


TACTTGAAAA 


ATATGGTAGA 


AATAGAAAGG 


120 


ATGGAGGAAT 


CTAATGGTAT 


TACAAAGAAA 


TGAAATAAAT 


GAAAAAGATA 


CATGGGATCT 


180 


ATCAACGATC 


TACCCAACTG 


ACCAGGCTTG 


GGAAGAAGCC 


TTAAAAGATT 


TAACAGAACA 


240 


ATTGGAGACA 


GTAGCCCAGT 


ATGAAGGCCA 


TCTCTTGGAT 


AGTGCGGATA 


ACCTACTAGT 


300 


AAATCACTGA 


ATTTTCTCTT 


GAAATGGAAC 


GCCAGATGGA GAAGCTTTAC 


GTTTATGCTC 


360 


ATATGAAGAA 


TGACCAGGAT 


ACACGTGTAA 


GCTAAGTATC 


AAGAGTACTA 


TGCCAAGGCC 


420 


ATGACACTCC 


TACAGCCAGT 


TAGACCAAGC 


CTTTTCATTC 


TATGATCCTG 


AATTTATGGA 


480 


TATTAGCGAA 


AAGCAGT 










497 



(2) INFORMATION FOR SEQ ID NO: 151: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



CTGTTATGGA 


GTTGAAAGAA 


GAATACCACC 


CAGACGGTAA 


AGGTTTTGAA 


ATGATCTGTA 


60 


TCGGTGGGAT 


GGGAGCTGAT 


TTCTTTAAGG 


CTCGCGGTAT 


TCAACCACTT 


TATGAATTAC 


120 


GTGGCTTGTC 


AGACCAACCT 


AGCTTTGATC 


AAGTTCGTAA 


GATTATTTCA 


AAAACTGTTG 


180 


AAATGTACCA 


AAATGAACTC 


TTTGATGAGC 


TTTATGTTTG 


CTACAACCAC 


CATGTCAATA 


240 


CGCTAACCAG 


TCNAATGCGT 


GTGGAACAAA 


TGCTTCCGAT 


TGTTGACTTG 


GATCCAAATG 


300 


AAGCGGATGA 


AGANTACAGC 


TTGACTTTTG 


AAATTGGGAA 


ACCAGCCGAG 


AAGAAATTCT 


360 


GGAGCAGTTG 


TTGCCTCAGT 


TTGCAGAAAG 


TATGATTTAC 


GGTGCCATTA 


TCGATGCCAA 


420 


GACAGCTGAG 


AATGCTGCGG 


GTATGACAGC 


CATGCAAACA 


GCGACAGATA 


ATGCTAAGAA 


480 


AGTCATCAAT 


GATTTGACAA 


TTCAGTATAA 


CCGTGCCCAG 


ACAGGCGGAT 


ATTACACAAG 


540 


AAATTACCGA 


AATCGTAGCA 


GTGCCTAATG 


CCTTAGAATA 


GCTCTAGTCC 


CAGCTCTCCT 


' 600 


CACTC 












605 



(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO;152: 



CTACACTTGA 


GGGAAGTTTG 


AAAATTCCAT 


TTTTCTTAAA 


GGGAATTGTG 


AAACGCTATG 


60 


TATTTTCTTT 


TTACAACCGG 


ATGGAGCACT 


TGGTTGTGGT 


CAATCCTATG 


TTTATTGAGG 


120 


ATTTGGTAGC 


AGCTGGTATT 


CCACGTGAAA 


AAGTGACCTA 


TATTCCTAAC 


TTTGTCAACA 


180 


AGGGAAAAAT 


GGCATCCTCT 


ACCACAAGAA 


GAGGTAGTCA 


GACTGCGCAC 


AGATCTTGGT 


240 


CTTAGTGACA 


ATCAGTTTAT 


CGTAGTAGGT 


GCTGGGCAAG 


TTCAGAAACG 


TAAAGGGATT 


300 


GATGACTTTA 


TCCGTCTGGC 


TGAGGAATTG 


CCTCAGATTA 


CCTTTATCTG 


GGCTGGTGGC 


360 


TTCTCTTTTG 


GTGGTATGAC 


AGATGGTTAT 


GAACACTATA 


AGAAAATTAT 


GGAAAATCCC 


420 


CCTAAAAATT 


TGATTTTTCC 


AGGCATTGTA 


TCGCCAGAGC 


GGATGCGCGA 


AATTGTATGC 


480 


TCTAACGGAT 


CTTTTCTTGT 


TGCCTAATTA 


CAATGAG 






517 



(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 798 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 



60 



TGTTCGCCTC CATATCCNAA ACTTGCCCTG CCACCATACC CAGACTACCT GAAGCAAGGG 

ATAAGTTGGC AATCGAGCCC ACCTNGATCT GACTTGGCGA ATCTGCCTGC GCAATCAAGG 120 

CATATGGGTC TAAGAATAAG GATCTCCAGC CAAAATGGCC ATAGCTTCAC CGAATTTCTN 180 

GTGATTGGTT AACCGCCCTC TTCGATAATC GTCATCATCC ATAGCAGGAA GGTCATCGTG 240 

AATCAAGCTC CCTGTATGAA TCATCTCCAA GGCAGTAGCT ACCTGCGCGN GAGCAGGTTT 300 

GATGGTAACC TGCAAGGCTT CCAGAACTTC TAACAAGAGA AAAGGCCGAA TACGCTTGCC 360 

ACCAGCATGA ATAGAATAGA GAACAGACTC CCGTAAACTA GAGGCAAACT GCTGGTCTCC 420 

ATAAAATCTT CCAAAGCCGA CTCGACAAGA GCTAATTTTT CTTGCTTTTT CATTCAAAAT 480 

CACTTTCTGT TCCGTCTTCT TGCATGACCT TGACCAAGGT CTTTTCAGCC TTGTCCAGCG 540 

TAGCTTGGAG CTCTTTTGAC AAGACCATGC CCTTTTGAAA GGCAGTAATC GCATCTTCCA 600 

GAGCAATTTC ACCATTTTCC AAACTTTGGA CAATGGTTTC CAGTTCTGCT AGATTTTCCT 660 

CAAATTTCTT TTGTTTTGAC ATCTTTAACC TCTAATTCTA CTTGACCATC TCGCATCAAA 72 0 

AGCGTTACTT GGTCTTTTTT CTTCAAACTC TCAACCGAAT CTACAACGGA TCTTCTTTTT 780 
TGACAATAGC ATAACCAC 



798 



(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

CCATCTTTGC GGCAGTGACA GCAGGTTATC AGGCTGCCCT AATGGTACCA ACAGAAATCC 
TCGCAGAGCA ACACTTTGAG AGTTTACAGA ACCTTTTTCC CAATTTGAAA CTGGCTCTCT 
TGACAGGTTC CTTGAAAGCT GCAGAAAAGA GAGAAGTCTT GGAGACCATT GCCAAGGGTG 
AGGCTGATTT GATTATAGGA ACTCACGCTC TGATACAAGA TGGGGTGGAG TATGCTCGTC 
TTGGTTTGAT TATTATCGAT GAGCAGCACC GTTTTGGTGT AGGGCAAAGG CGTATT1TAC 
GGGAAAAAGG CGACAATCCA GATGTCCTCA TGATGACGGC GACTCCCATT CCACGGACGC 
TTGCCATCAC AGCCTTTGGA GATATGGATG TTTCCATTAT CGACCAGATG CCAGCAGGTC 
GGAAGCCCTA TTGTGACGCG CTGGATCAAA CATGAGCAAC TACCTCAGGT CTTGACTTGG 
TTAGAGGGGG AAATTCAAAA AGGTTCCCAA GTCTATGTCA TCTCTCCTTT GATTGAAGAA 540 

148 
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(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 



CTCTTTTAAC 


CGTTTTAGCG 


GTGACACCGA 


GGTATTTTTT 


CAGGACCCAA 


GACTTGTCGG 


60 


GCAACCGAAA 


CTGGGAGTTC 


GTCATCTCCA 


ATATGCAGAC 


CAGCAGCATC 


AACCGCAAGA 


' 120 


CAAACATCCA 


ACCGATCATC 


GATTATCAAG 


GGGACCTGAT 


AGGCATCTGT 


TATTTCCTTG 


180 


ACTTGTTTTG 


CCAGTTGATA 


ATATTGATTG 


GTTGTGAGAT 


TTTTTTCTCG 


CAATTGGACT 


240 


ATGGTAACCC 


CTGAACGGCA 


GGCCGTCTCA 


ACTTTTGCAA 


GAAAGCTTTC 


CACGGAATCT 


300 


TGATAGCGAT 


TGGTTACCAG 


ATATAGTCTA 


AGCGCTTCTC 


TATTCATAAA 


CCTCTCCTTT 


360 


GATGGTATCT 


AGCCAATTTT 


CATCTCTTCT 


TAGGAGCGAA 


AGCTGATTGA 


GTACTTGGTA 


420 


ACGAAATTCT 


TCCAATCCCA 


TTCCTTGAAC 


AACTATTTTC 


TCAGCCAGCG 


ATATTGAGAT 


480 


AAGAGACTGC 


TAAGCAAGAA 


CTTCAAAACC 


AGTCTTTCCT 


TGGCTGAGAA 


AAACAG 


536 



(2) INFORMATION FOR SEQ ID NO: 15 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 



CTGGAGTGAC 


GATTGAGAAA 


ATCTTTGCCC 


TAGAAAATTA 


TCGAAATCAG 


TTAGCTGCTT 


60 


TTCCGCAAAC 


TGTCTGGATT 


TCAGAGGATA 


TTTTGCTAGA 


TTTGGCGGAT 


TCTCAGACTC 


120 


CACAGGGAAT 


TGTTGCCGTG 


GTTCAAAAAG 


AAGAAGTAGG 


ACAAGCTGAT 


TTGAGTCAGG 


180 


GCAAGTTCTT 


GTTTTTGGAA 


GATGTGCAAO 


ATCCTGGTAA 


TGTAGGAACT 


ATCATTCGAA 


240 


CTGCGGATGC 


AGCAGGTTTT 


ACAGGAGTGA 


TTGTTTCAGA 


TAAGTCGGCA 


GACATCTACA 


300 


GTCTTCAAAA 


CCCTACGTTC 


CATGCAAGGT 


AGTCATTTCC 


ATCTGCCCAT 


TTACCGGATG 


360 


ACTAGTCAAG 


CGCTTCTTGA 


CGAAACTAAA 


AAGGTAGCTA 


TCCCAGTGCT 


AGCAACAACC 


420 


CAATCTAAAG 


ATTCTGTTGA 


TTACAGAGAA 


CTGCCTTCTA 


TAGAAAATTT 


TGTACTAGTT 


480 
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ATGGGAAATG AGGGTCAAGG AATTAGTCCC CTTATGGCTG AAAGTGCAGA CCAGTTGGTC 540 

CATATTAGCA TGAAGGGGCA GGCCGAGAGT TTGAATGTTG CGGTTGCAGC CGGTATTTTA 600 

ATCTTCCATT TAAGCTAATT TTAACTTTTT TTGTTATAAT CAAGGAAAGA TGTTCACAGA 660 

AAAGGAGAAA TGGATGAATC ACACTATTAT ACATGACCGT GCAGGTCTCA ATCAATTTTA 720 

CGCTAAGGTT TATGCCTTTG TTGGTCTGGG AATCGGACTA TCTGCTTTGG TATCAGGCCT 780 

TATGTTGACG GTCTTTCAGT CTCAGTTGGT TTACTTTTTG ATGCAGGGGC GTCTCTGGTT 840 

GACCATTGCT ACTT 854 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 



CTGAAGAAAT 


CCTAAAAGAT 


TTGTATTACG 


GAGCCGCTAA 


GAAAATTCAA 


GAAGAAGCCT 


60 


CTCGCTGGGC 


AGGAGTTGTA 


AGAAATGACT 


AAAACAGCCT 


TTTTATTTGC 


TGGTCAAGGT 


120 


GCCCAGTATC 


TACGGATGGG 


ACGGGATTTC 


TATGATCAGT 


ATCCGATTGT 


CAAAGAAACG 


180 


ATTGATCGAG 


CGAGTCAGGT 


GCTCGGTTAT 


GATTTACGTT 


ATCTCATCGA 


TACGGAAGAA 


240 


GACAAACTCA 


ATCAGACCCG 


CTATACGCAA 


CCAGCCATTC 


TAGCGACTTC 


GGTTGCTATC 


300 


TACCGTTTAT 


TGCAAGAAAA 


GGGCTATCAC 


CCTGATATGG 


TTGCTGGTTT 


GTCTCTTGGA 


360 


GAATACTCTG 


CCTTGGTGGC 


AAGCGGCGCC 


TTGGATTTTG 


AAGATGCCGT 


TGCCTTGGTA 


420 


GCTAAGCGTG 


GAGCCTATAT 


GGAAGAAGCG 


GCTCCTGCTG 


ACTCTGGCAA 


GATGGTAGCA 


480 


GTTCTCCATA 


CCCCGTAGAG 


TCCTTGAGAA 


GCCTGTCCAA 


AGCTCTGACT 


TGAGTGGTAC 


540 


TCCAGCCACT 


ATACACACTG 


CACAATCTCA 


TTGCTGGAAA 


ATTGTTGCAG 


TGATCAACGG 


600 


TGACTTTGCA 


GAGCAGTGCC 


AACCCTATCC 


CCTAGT 






636 



(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 
JA) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



150 



WO 97/37026 PCT/US97/05306 



CTCTAAAACG 


AATTCTCTAC 


AAGCACCGCA 


AGGGCATGGC 


TGAACTTCCA 


CCATAAGGTG 


60 


GTTTGTCTCG 


AAAGGCTAAT 


ACTTCNTTAA 


CCTTAGTTTG 


TCCTGAAAAT 


TGGTACATAT 


120 


TGAAGAGGGC 


CGCCCGTTCT 


GCGCAGAGAT 


GGAAAACACC 


ACAGGTTCCC 


TCCATACAGA 


180 


ATCCTGTAAA 


TATTTGTCCA 


TCTCCTGCTT 


CTACTGCAGC 


TACAACATGA 


TTGGCATAAA 


240 


CAAAGTCTGA 


TACTTCATGT 


GGATTGTATA 


GTTTCTGTGC 


TTCTTCCTAC 


ATCTTTTCCC 


300 


AGATGTCCAT 


TATTGTATCC 


TCTATATTTA 


GAGATTTCTT 


TTAGAATGTT 


TTCGATATGC 


360 


TGAATTGATT 


TTTCACGTCC 


AAGCAAGAAA 


ATTGTATCTG 


GTAATTCTGG 


CCCATGCATT 


420 


TCGCCTGAAA 


CTGCGATACG 


AATAGGCATG 


AAAAGATTTT 


TCCCTTTAAT 


ACCTGTTTCT 


480 


TTTTGGACTG 


CTTTAATTTG 


CGGGAAGATA 


TTTTCTGTCA 


CAAATTCATC 


ATCTGTCATC 


540 


CGCTTCAATT 


TTTGCTTTGA 


ATGCTTCAAG 


AACTGTTGGA 


ACTGTTTCAC 


CCGTCATGAC 


600 


TTCGCGCTCT 


GCTTCTGTCA 


ATTCCTGGGA 


AATCTGAGAA 


GAAAGATCTG 


TCCATGGGAT 


660 


ATCTCATCTA 


CTGATTCATT 


GTGGTTTATA 


GAG 






693 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 



CTGACAAAGG 


AGACGGGTAT 


GGAACAAACA 


TTCTTTATCA 


TCAAACCAGA 


TGGTGTAAAA 


60 


AGAGGGCTAG 


TGGGTGAAGT 


GTTAAAGCGC 


ATCGAACAAC 


GTGGATTTAC 


AATCGAAAAA 


120 


TTGGAGTTTC 


GTTCACAGGT 


TTTCAGAAGA 


GTAGATTGAC 


CAGCACTATC 


AGGACCTGGT 


180 


TGGTCAGAGT 


TTTTACCCAC 


CGATTCGTGA 


ATTCATGACT 


TCAGGTCCAG 


TTCTTGTGGG 


240 


TGTCATTTCT 


GGTCCCAAAG 


TAATCGAAAC 


TTGGCGGACC 


ATGATGGGTG 


CAACTCGTCC 


300 


AGAAGAAGCT 


TTACCAGGCA 


CTATTCGAGG 


TGATTTTGCA 


AAAGCTGCTG 


GAGAAAATGA 


360 


GATTATCCAA 


AATGTTGTAC 


ATGGTTCAGA 


TTCCGAGAAG 


AATCAGCTAA 


GCCGAGAAAT 


420 


TGCTCCTTTG 


GTTTTAAGAG 


TGGATTGGCT 


CAATCAATTG 


GATAAAAGCT 


CATTTGAATA 


480 


GAAAGTATAG 


TCAATTAGTT 


TAAGACATGA 


CGCATGATAT 


CAAACTTTTT 


AGTTTTTGAT 


540 


ATGGTGCGTT 


TTT 










553 



(2) INFORMATION FOR SEQ ID NO: 160: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:160: 



CTAGAGATTT NCGAAGAGTA TTATTGTACT TTAAAGGTCT TGAGATAATT GTCTTTNCCT 60 

ACTTGACCTT CGAAGGTTTT ACCATTTTCA AGGTAAGGAA GGTCATCGGA TACTGAAGCC 120 

TTGACCTTGT ATATCTTGCC ATCAACTTTA AAGAAGTAGA CAGTGTCTCC CTTGATAACA 180 

GCTGATTTGA GGTCTGCTAC TACTCCCTTG ATGCTTTCTG TCGTTGCATT GTCAATTTCA 240 

AGGTCGTTTT TATTGGCATA CTTGCTGAGC ATCTCTTCCA CTGTAGTAGC AACGATAACA 300 

TTTTGGTACT CGACTGCGTC TACCAGGGCG TACTCTTTGA CCAAGCCAGC ATTGTCCTTT 360 

AAGCCCATGA TGTAAAGAGG CTTGTCATTG AGGTTGATAA AGATTGGGGA AAGGTTGCTT 420 

TGTAGGATTT CTCCTGAACA GCACCTTCTG CTGATTCACG GGCTGATTCT TCTGTCGCAG 480 

AAGCCAAG 488 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

TACAGCTATC GGGTCCGTCT GGTAGTTGGT CCTCAGAACC TTCACTGCCA CTTCTTCCCC 60 

ATCTAAGATT AAGTCTTTGG CTAGGTAGAC ATCCGCCATG CCTCCTCGAC CAATCTGTTT 120 

GACAATCCGA TAGCGTCCGG CAAAAATCTT GCCGATTTGG ATCATTCTTC ATCCTCCTCG 180 

TTCATAGAAA CAAGGGCAAC CGTAATGTTG TCTAAACCTC CTGCATTGTT AGCAAAACGA 240 

ACAAGTGTCT CCGTTTTATC TGCTAAAGGA ATATCACTGG TTACAATATC ACGAATCTCA 300 

CTGCCTGAAA TCATGTTGGT CAAGCCGTCA CTATTGAGCA AGAGATAGTC ACCTGACTCA 360 

AGGATAACTG TCCCAAAATC AGGCTGAATT TCATCTTTTT GCCCAATAGA CTGGGTGATA 420 

ATATTTTTTT GTGGATGACT TCTGCCTCTT CTGGTGTCAA TTGACCAGCC TTGAGCAATT 480 

CATTAACCAA AGAATGATCG CTCGTCAACT GATGGTATCT TCTCCACGAA TCAAGCCGAT 540 
ACG 

(2) INFORMATION FOR SEQ ID NO: 162: 



543 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 504 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 



CTTTAAGGAA AATCAAATCT 


CTCATGCTGA 


TACCTCTCCT CATTAAATTA AATAGTAAAA 


60 


AAGATTCTAT 


CTCACTCCCT 


GATTATTACA 


AAACCATTGA 


AATATCACAA CTAATAGGCT 


120 


AGAATGGACA 


TAGTAAGATN 


TAGTAGATGA 


GTCATTCTAC 


TCAAATCCAC GTTAGAAAGG 


180 


ACTGCTATGC 


CAGACAATCT 


CGCGCTTCGC 


ATGCGCCCTA 


AAACCATCGA CCAGGTCATC 


240 


GGTCAGGAGC 


NTCTGGTCGG 


ACCTGGAAAA 


ATCATCCGCC 


GCATGGTGGA AGCCAACCGC 


300 


CTGTCCTCCA 


TGATTCTATA 


TGGCCCTCCT 


GGAATCGGCA 


AAACCAGTAT TGCCTCTGCC 


360 


ATCGCTGGAA 


CGACCAAGTA 


TGCCTTTCGA 


ACTTTCAATG 


CGACAGTTGA TAGTTAAAAA 


420 


GCGACTGCAA 


GAAATCTCGG 


AAGAAGCTAA 


ATTTNCTGGT 


GGTCTCGTCC TATTGCTAGA 


480 


CGANATTCAT 


CGACTAGATA 


GACC 






504 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 



CTAATAGAGG 


CGAAAAAATT 


TCAGGTCCTC 


CTTTGCTAGA 


TGATAATGAG GAAACTAAGA 


60 


TTTTACCAAC 


CTCTTCTTCC 


CGTTTTGGTT 


ATGCCAATCC 


TAAGGATCAT 


GGTTTTAGCC 


120 


AGGAAACCTT 


GAAGATTCAG 


GTCGAACCAT 


CTATTCATAA 


AAGCCGTCGT 


ATTGAAAATA 


180 


CCAAGAGAAA 


TGTCTTCAAT 


TCTAAGTTGA 


ATAAAATCTT 


ATTTGCGGTC 


ATCTTTCTCT 


240 


TGATTTTGCT 


TGTTTTAGCA 


ATGAAACTTT 


TGTAATAGAA 


AAGGAATTGA 


AATGAAAATA 


300 


GGAATTATTG 


CTGCTATGCC 


AGAAGAACTG 


GCTTATCTGG 


TCCAGCATTT 


AGATAATGCC 


360 


CAGGAGCAAG 


TTGTTTTGGG 


GAATACCTAT 


CATACAGGAA 


ACCATTGCTT 


CTCATGAAAG 


420 


TCGTTCTTGT 


TAAAAAGTGG 


AATTGGTAAG 


GTCATGTCTG 


CTATGAATTG 


TGTGGCGAAT 


480 


TTTGGCTGAT 


CATTTCCAGG 


TTGGATGCCC 


TTATTAATAC 


GGGTTCAG 




528 



(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 



60 



CTAATAAGAC TGAAAAGAAG TCCAGCCAGA TGTCCCTAAA AATACAGAAA AAACATTAAA 

ACCAAAGGAA ATCAAATTTA ATTCTTGGGA AGAATTGTTA AAATGGGAAC CAGGTGCTCG 120 

TGAAGATGAT GCTATTAACC GCGGATCTGT TGTCCTCGCT TCACGTCGGA CGGTCATTTA 180 

GTCAATGAAA AAGCTAGCAA GGAAGCAAAA GTTCAAGCCT TATCAAACAC CAATTCTAAA 240 

GCAAAAGACC ATGCTTCTGT TGGTGGAGAA GAGTTCAAGG CCTATGCTTT TGACTATTGG 300 

CAATATCTAG ATTCAATGGT CTTCTGGGAA GGTCTCGTAC CAACTCCTGA CGTTATTGAT 360 

GCAGGTCACG TAACGGGGTT CCTGTATACG GTACACTCTT CTTCAACTGG TCTAATAGTA 420 

TTGCAGATCA AGAAAGATTT GCTGAAGCTT TGAAGCAAGA CGCAGATGGT AG 472 

(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

CTTTTAAATA AGAAGAATCA CACTCATTGC TAAATGTGAT ATAATATTGT TCTCAGTCTC 
AGCAGTGCGT TGTGAAAAAG CCATACTAAT CCCTGATTTT TCCGCATATT CCATCACCTC 
AAGTGTTTTA CTCACTGAGC CAACTTGATT AGGCTTTATT ACAATACCAT CAGCACAATC 
TTTAAACTGT GAAATTCTTT CTAAGTTTGT TGCGTAAAAA TCATCACCAA AAACTTGTAA 
CTTTAGAGGT TTTATCAATT GAAATTTTCT CCAACTATCC AAATCTTCAT CGGAAAATGG 
ATCTTCAAGA TAAACCAATG GATACTTAAC TCCCCAATCA CAATAAGTAT CCATAATTTC 
TGTGACTGTC TGTTGCTGAC TAACACACCA TGGTACTTTA TATATGCCAA GTGAAGAATC 
GTAACGATCT GTCATTGCTA AATCTAAACC GATATCATAT CTATTTTTGT ATAATTTTAA 
TGTCTCTAAT AACGAATCCA AAATAATATA AAAATCATCC GTGTTAATTA TTAATGCACC 
TTGATAGAAA TAGATGTTGT AGAAACCTGG GTTTTAACTT TATCATAACT CATTTTTATG 
TTAGA 

(2) INFORMATION FOR SEQ ID NO: 166: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
605 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

CTGAAGAAAA TCAGTAAACA AGAGCTGGTT AACACGCGCT TTTCTCGTTT ATTTGCTCAT 60 

TTTGGACAGG AAAAAGACGG TAGTTTTCTT GCCCAGCGTT ACCAATTTTA CCTCGCCCAG 120 

CAGGGACAAA CACTATCGGG CGCTCATGAT CTCTTGGACA GCCTCATTGA GCGTGATTAT 180 

AACTTGTATG CTGCGACAAA TGGCATTACT GCCATTCAGA CAGGACGTTT GGCTCAATCT 240 

GGTCTAGCAC CTTATTTCAA TCAAGTCTTT ATCTCAGAAC AGTTGCAAAC TCAAAAGCCG 300 

GATGCTCTTT TTTATGAAAA GATTGGCCAG CAAATTGCTG GATTTAGTAA AGAAAAGACC 360 

TGATGATTGG AGATTCTCTA ACCGCCGACA TTCAAGGTGG CAATAATGCG GGGATTGACA 420 

CTATCTGGTA TAATCCTCAT CACCTCGAAA ATCACACACA AGCCCAGCCG ACTTACGAAG 480 

TCTATTCTTA CCAAGACTTG CTGGATTGTT TAGATAAAAA TATTCTTGAA AGATCACGTT 54 0 
TTAAGGAGAT AG - 552 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE ; Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

CTTTGTCTGT ATCTGCCTTT TTATGCTATA CTTAAGGTAT GCATAGAAAA ACAGTGATTG 60 

ATTNTAGGAC TTTAGGGGAG AGATATACCT TTACCCANCC TATTAAAGAG TTGAAAACGA 120 

GAAATGTAGC AGAAGTGGCA GATTTGCTGG CACNAGTGGA AAGCTACCAA GAGCAAGATT 180 

ATTATGTGGT GGGGTATGTC AGCTACGAGG CTGCACCTGC TTTTGAGGAG AAATTAGCAG 240 

TTCACAAGGT TCCTCTACTG GGCGAGTACT TGCTTTACTT TACTGTTCAC GATAGGGTGG 300 

AGACCTCCCC TATTCCTCTG ACTTATGAGG ATATTGATTT GCCCTCAAAT TGGCAGGGAA 360 

NTAACGTCTG CACAGAACTA TGANAAGGCC ATTGCCCAGA TACAC 405 

(2) INFORMATION FOR SEQ ID NO : 1.68 * 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 : 

CTCAAAAACA GCTTCTAACA GCTTGAACAA GGGGGCTAAA ATTGATTCCT TGATGTAGGG 60 

TTTGAAGTAA GATAATAGGT GTTTCATAAA TCCCTTCTAT TCATATTCTA GAAATGAAGA 120 

AAGTGGGAAG CCCCACTCTC TGTTTTATTT GTTTAAGTAA GGCAATAGAT AGCCATATCC 180 

TGCTTTTTCC ATCTCATCCT TGGCCACAAA GCGTAAAGAA GCAGAATTGA TACAGTAACG 240 

GAGGCCGCCT AACTCCCGCG GTCCATCTGT GAAAACATGA CCCAAGTGAG CACTGCCTGA 300 

ACGAGAACGA ACTTCAATTC GCTCCATTCC ATGGCTCAGA TCCTTGTAAT AATGAATCAA 360 

CTCTTTGGAA ATCGGACGGC TTAAAACTTG GCCAACCCAC AACCTGAAGC AAACTTATCC 420 

TTGGCAAAAA AGAGTTGGCT CACCTGTTCG TAATATCTAC ATAAATCCCC TCCTTCCAAA 480 
GGTTTTGGTC CATAGGCATT GGTAAATGGA CCTCCTGTTA CCAG 

(2) INFORMATION FOR SEQ ID NO: 169 : 



524 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

CTTCATCTGG TGGTTCTCCA GTTGTAAGTA ACGTAGTACC ATTCTCAAAA CCATACTCAC 
AAGTTCGTAT TAAGGGGGAA CCATATATCT TTAAAGGAAT GAAATTGCAA AAAGATATTG 
TTACAACAAA AGAATATAAC GAGGTTTTAA AAAATGGCAA AAAGAAAAAT TGGAATCCAA 
TAGCAAATAC CAAAAAGAAC TAGAAAAATA CATTAAATAA GGAATGGTAT TGATCTTGAT 
AAAATTTTTA AAATACTGTC ATTTTGAATA TAAAGGAGTT TGATATGGAG TGGATTAGAT 
TAATAGGAAT AGCAATCATT GTTGTGGGTT TTATTTTAAA ATTTGATACA ATTGCAACAG 
TAGTCTTAGC TGGTTTGGTT ACAGCTTTAG TTTCAGGTGT TTCTCTCGTT GAATTTTTGG 
AGATTTTGGG AAAAGAATTT AGCAATCAGC GAGTGCTCAC GATTTTTATG GTTACCTTGC 
CTCTTGTGGG GCTGTCAGAA ACCTTTGGAC TCAACAACGA TCAATCGATT TGATTCGAAA 
GATTAAAGGT CTGACAGTTG GAAACTTCTA TACAGTTTAT TCTTTATCGA GAG 

(2) INFORMATION FOR SEQ ID NO: 170: 



60 
120 
180 
240 
300 
360 
420 
480 
540 

593 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DMA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: j 



CCTGCTCCTT 


ATCTTTGCAG 


TAATTGGCGT 


TTTAGTGGCC 


TTGATAGCTC 


AATTTTACTC 


60 


AGCAAAGGCA 


GCAGTAGGTT 


TTGCTAAGGA 


ATTGACAAAC 


GATCTTTATC 


GTCATATTCT 


120 


TTCCTTGCCC 


AAGGACAGCA 


GAGACCGTCT 


GACAACTTCT 


AGTTTGGTTA 


CTCGCTTGAC 


180 


TTCGGATACC 


TACCAGATTC 


AGACTGGTAT 


CAATCAATTC 


CTGCGTCTCT 


TTTTACGAGC 


240 


GCCCATTATC 


GTTTTTGGTG 


CCATTTTTAT 


GGCTTATCGA 


ATCTCAGCTG 


AGTTGACTTT 


300 


CTGGTTCTTA GTCATGGTTG 


CCATTTTGAC 


CATTGTCATT 


GTTAGGGTTA 


TCTCGATTGG 


360 


TCAATCCTCT 


CTACAGTAGT 


CTCAGAAAGA 


AAACCGAACC 


ACTGGGTTCA 


NGAAACCCCC 


420 


CNCCATTGCA 


AGGATGCCGG 


GTTATTCCGT 


GCCTTT 






456 



(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 



CTTGTACAGT 


TGAATCGTTC 


AGATATTGAT 


TTTGAAAACA 


GAGAGTGTGT 


TGTCTTTGGT 


60 


AAAGGAAAGA 


AGGAGAGACC 


AGTATATTTT 


GACGCTCGGA 


CGAAAATTCA 


TTTAAGAAAT 


120 


TATCTTAACG 


ACAGAAAAAG 


ATAGTCACCC 


TGCTCTTTTT 


GTAACGCTAG 


TTGGAAAAGC 


180 


CCAGAGACTT 


GGAATTGCTG 


GTGTAGAGAT 


TCGCTTAAGA 


AAGTTAGGAG 


ACAAACTCGG 


240 


CATACAAAAG 


GTTCACCCAC 


ATAAGTTCAG 


AAGAACTTTA 


GCGACTAAGG 


CAATTGATAA 


300 


AGGTATGCCT 


ATCGAACAAG 


TCCAAAAACT 


GCTAGGTCAT 


AGCAAGATTG 


ACACAACCCT 


360 


GGCCTATGCC 


ATGGTCAATC 


AAAATAATGT 


CAAGCATTCA 


CACCAAAAAT 


TCATCTCTTA 


420 


AAAGCAAATC 


CCGATATTCC 


GAAGAGGTGG 


TTGCATGAAG 


AAAGTGAAAT 


TAGGTGAAGT 


480 


GGCTACTTTT 


ATCAATGGCT 


ATGCTTTTAA 


ACCTCAAGAT 


TGGTCCTCTG 


AAGGAAAGAG 


540 


ATTATCCGAA 


TTACCGAATC 


TGACT 








565 



(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 53? base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



i 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



CTTTATACTT 


TATCATTTCT 


AACTTAATTA 


TAGTCTTATT 


TTTAATAAAG 


TTCAATTATT 


60 


TATATGTAAA 


ATTCCTCATC 


AGTAAAAAAT 


AACCGATCTC 


ATTTCTGAGA 


ATCGGTTTTC 


120 


TAAATAAATC 


AAACCGATCA 


TTTACATAAC 


ATAAATTATG 


TAAATGATCA 


TACTACAACA 


180 


ACAAATCTTT 


GACTTTTCCA 


ATTTCACTTT 


TTGGAATAAC 


CAGGTGAATC 


ATATCACCCA 


240 


GATACATTCT 


GGTTGAGCCG 


TTAACTGTTT 


GGCTCTTGCC 


ATTATGGACT 


TGAGTTGTGA 


300 


TGAGGACGTT 


GTGTGGTAAG 


TTGAGTTCAT 


GAACTTGTTT 


CCCAGCAATT 


TTATCAGAAA 


360 


CTGGTATTTC 


GATAAGTGTA ACTTCTCCTT 


CGCTAGATAC 


TTCTTCTGGG 


AAGCATTTTT 


420 


TCCAGCATGG 


CTTCATAGAC 


TGGCGTTCCT 


TTGAGCNAAT 


CCCTGATOAT 


TTTAGAAACC 


480 


AGAGTGACAG 


ACCAGTGGCT 


TAAGGTGCGA 


ATATCTCCTA 


CCATCTCAGT 


TACGAGAT 


538 



(2) INFORMATION FOR SEQ ID NO: 173: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 509 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



CGTGCTAACC ATGAGAGCAT ATATTTCCTC CAACTATTTC TATTTCAAAA AATCCTTACT 60 

ATATTGTATC ACAATCAGAC ACAAAAAGAA AAAGCAAATG ATAAACAAAT GCTTTTAANG 120 

TTTTAAAAAA AGCTTCGAAA GGTTCTTCTT TATTTTTTAA NGGGAGAGAT AACGTTGATA 180 

TCTAAATCGT GGTCAAAGCC GGCAATTTTC CTTTAGATGT GTATTGGTGA ATATCATAAT 240 

CTAAATCAGT TTTAGGACTG CTCTCCAAAA ATCCTGAGTC TGAGCCGTAG GACGGAATCC 300 

AAACAGAGGT AAACTTGCCT GTATCAATAC TGTGTTCTTC CATGAAGTAG ACACCAACGT 360 

AGATGCCGAT GTTTTTAGCA CCTAGTGATG CTAGTTTTGC TCGAAAGTTT TCGACACCTT 420 

CGTTCATATA GACATAGTTT TGTCTCCACG TCAGCCCATA GTAACTAGGC TGTTAGGAGA 480 

GCACATTGTA GAAAACTTCG GCAGCCTTT 509 



(2) INFORMATION FOR SEQ ID NO:174: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 



CTGTTCTGTT 


TGCGACGCTA 


CTTGGTTGTA 


GCATGGGGGT 


GGTTTTAGAT 


GGTCAGTAAG 


60 


TATCTTTTAT 


TAGCAGTTAT 


TTTCTCTGGC 


TTGGTGACTT GGATTCTCCG TATGATTTCC 


120 


TTCATCTTAG 


TCAAGTATAA 


GGGCTTGCCT 


GCAATCGTTG 


AGCGTTTTTT 


GAAGTTCTTG 


180 


CCCGTTTCCA 


TTATCTTTGC 


CTTGATTCTT 


TCAAGCGTAG 


TGACAGGTAA 


GGTTGGGAGC 


240 


CTTCCTCAAA 


TTAAATGGCT 


AGACTTCTTA 


GCCGTCTTTC 


CAACAGCTTG 


GGTAGCCTTT 


300 


CGCTACCGCA 


ATCTACTCGG 


AACAGTTCTC 


TTTGGAGTGG 


TCTTGATTGC 


CATCTTGCGT 


360 


TTGGTCTCTT 


AAATTACCCA 


CCAAAAAAAC 


TTATCACAGA 


GATAGATATC 


ATATAATGGC 


420 


GTAAATGCTC 


CTTTTCTGTT 


AAGATTATAA 


GGTATTCTAT 


TTTGGAGGAA 


ATGACATGAA 


480 


AAAAATCGTT 


AAATACTCAT 


CTCTTGCTGC 


CCTAGGACTT 


GTTGCTGCAG 


TGTGCTTGCG 


540 


GCTTGCTCAG 


GGGTGCTCAG 


AAAGAAGAGA 


AC 






' 572 



(2) INFORMATION FOR SEQ ID NO: 175: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 



CTAGGTTATT 


TAGTAAAAGA 


CTAGAAGATT 


TTCTTTTCCC 


AATCGTCTTC 


TGTACGGCGA 


60 


GGATAGAAAA 


ACTCTGACTT 


GTCGGGAGCT 


TCCATCATTT 


CCATCAATGG 


TAGCATATCA 


120 


TAGGCCAGAT 


TTAAGTTTGG 


AATCTGGTCT 


TTTTGCACCC 


AAGAAACTTC 


TCCCTCTTCT 


180 


GAAGATTGAA GGGTACCAGA 


GAACTCAGTC 


GCCTTATAAC 


AAATGACAAT 


ATAGCGCCCA 


240 


CCTGTATCTA 


GTGGCCAATT 


TTTAATGCCG 


ACAAGTTGAG 


GATTTTGGAT 


AGTCAACCCT 


300 


GTTTCTTCGT 


AGATTTCACG 


AATGACAGAC 


TCCGCAAAAG 


CCTCATCATT 


TTCTACATGA 


360 


CCTCCAGGAA 


AGGCATAACC 


AGACCAGCGA 


TTGTTTTCAG 


GGGCGCGATA 


CTGCATCACC 


420 


ACGCGCTGAG 


TTTCGANGTC 


TTCCATCAGA 


CAGATATTTG 


TTTAAATTGT 


TTAATTGGGA 


480 


ACGGGACATA 


AATTTAC 










497 



(2) INFORMATION FOR SEQ ID NO: 1^6: 

(i) SEQTiENCE CHARACTERISTICS : 

(A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : Genomic 



DNA 



{xi) SEQUENCE DESCRIPTION: 



SEQ ID NO: 176: 



CTGAAATTTT GGAAGAAGAA ATCGGTGTAG 



ATACAGTCCA AAAAATAGGA CGCATCTTGA 



60 



TTTTGTTTAA ACAATCTAGC AAGAAAGAAA ATCGCAAGAT TTCTAAGAAA GTCAAAGAAA 120 

TCTAAGATCG AAACTCCAAA TAACTGTTTT TATAGAGAAA TAAAGGGGAC TAGCCTATGA 180 

CAATCGAACT ATTGACTCCC TTTACCAAGG TAGAGTTGGA GCCAGAAATC AAGGAGAAAA 240 

AACGCAAACA AGTTGGGATT TTAGGGGGGA ATTTTAACCC TGTTCACAAT GCCCATCTCA 300 

TTGTTGCGGA TCAAGTACGG CAACAGTTGG GACTGGATCA AGTTCTGCTC ATGCCTGAAT 360 

ACCAACCTCC TCACGTTTGA TAAAAAGGAA ACCATCCCTG AACACCATCG TCTCAAGATG 420 
CTTGA 

(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

CTGCTATCAA GACAGTATTA CCGCTGAAAG AAACAGCTTC ACTAGAAGGC GCAATTGAAA 6( 

AATAAGACGG GATCACGAAC TCCCGTTTTT CTATAAAAGA AAGGAAATGG GATGAAAAAA 12< 

TTAGTCTTTG TCTGTCTGGG AAATATTTGC CGTAGCCCTA TGGCCGAGTT TGTTATGAAA 181 

TCAATGACAG ATAACTACGA AATCCAAAGT CGAGCAACTT CCTCTTGGGA ACATGGCAAT 241 

CCGATTCATA AGGGGACTCA GGGAATTTTT CAAGAGTATG AGATTCCTTA TGACAAGAAC 301 

AAGACATCGC TTCAGATTAG TAAGGAAGAT TTTGAAGCCT TTGATTATAT TATCGGAATG 36 

GACGCTTCAA ATGTTCCGAC TTACGTCAGA TGTGTCCAGT AGACTGTCAA GATAAGAT 41 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



160 



WO 97/37026 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic 



DNA 



(Xi) SEQUENCE DESCRIPTION: 



SEQ ID NO: 178: 



ACTGTTTGAA AAATATGATT CGTGGTCGAG 



AAATGAATTG CATTTAAGCA ATGTAGTTCA 



60 



GTATATAGAT TTGGAAATTA ATGATTTAAC AAAATAAAGG AGAAAAAACA TGGTTAAATA 120 

CGGTGTTGTT GGAGCAGGGT ATTTTGGAGC TGAATTGGCT CGCTATATGC AAAAGAATGA 180 

TGGAGCAGAN ATTACTCTTC TCTATGATCC AGATAATGCA GAGGCGATTG CAGAAGAATT 240 

GGGAGCAAAA GTTAGCAAGT TCCTTAGATG AGTTGGTTTC TAGCGATGAA GTTAGATTGT 300 

GTTATCGTCG CAACTCCCAA ATAATCTTCA TAAGGAACCG GTTATTAAGG CTGCACAGCA 360 

TGGTAAAAAT GTTTTCCTGT TGAAAAA 387 

(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 



CTTATGAAAT 


CAATCATCAA 


AATAATATTG 


ATCAGGACTA 


TTTAGGTAAA 


TTATCTACAA 


60 


CGATTAAATT 


GGTAGCAGAA 


AAGGAAAATG 


CCGTTGAGAT 


CCTAGAACAC 


TTGAATGTTG 


120 


TCCCTGTGTT 


GACAGCCCAT 


CCAACACAAG 


TGCAACGCAA 


AAGTATGTTG 


GATTTAACAA 


180 


ATCATATTCA 


TAGTCTTTTG 


CGTAAATACC 


GTGATGTTAA 


GTTGGGGTTA 


ATCAATAAAG 


240 


ATAAATGGCA 


CAATGATTTG 


CGTCGTTACA 


TCGAAATTAT 


CATGCAGACA 


GACATGATTC 


300 


GTGAGAAAAA 


ATTAAAAGTG 


ACTAACGAAA 


TCACGAATGC 


TATGGAAATA 


TTACAACAGC 


360 


TCCTTTTTGA 


AAGCTGTCCC 


TCATTTGACG 


ACGGAGTATA 


AGCGCTTAGC 


GCAAGCGCAT 


420 


GGTCTGAATT 


TAAAACAGGC 


TAAACCAATC 


ACCATGGGTA 


TGTGGATAGG 


TGGTGACCGT 


480 


GAAGGAAATC 


CATTTGTTAC 


AGCAAAGAAC 


TTGAAGCAGT 


CTGCACTCAC 


TCAGTGTGAA 


540 


GTCNTCCTGA 


ACTTACTATG 


A 








561 



(2) INFORMATION FOR SEQ ID NO:180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

CGACCTAATC ATATTGATAC TGCATTTGCA TCAGCAGATT TGTTTGAGTA TAAATTACAA 60 

TTAGCAGGAC AGACTTGGGG ATATTTAGAA TTTGAAACAA ATACAGAAAA ATATGGGAAA 120 

GTATTGTTAA TTATAAAGGG TAAGAAGCGA CTTACGAACC AATTTCCTTT GGTACAAAAA 180 

AATAAGAGTG GCTACTTATT TGAATATGCT CAGATGAATA CACTTTATCT TAATCAACAT 240 

TCTTCCTACA AAAATGATGA AGATAGTCAT TCCTTTCCAA TTCAGATGGA GTTAGTTTCT 300 

GATGAAATGA TTCCAAGAAA TTGAACAAGC TACTAAAAAT TCCGAATATC CGAAAAATTT 360 

ATGATTTTAA CTTATGANGC GGACTCCCGA AAAACAATAT TATATCTGTT AGATGTTTGT 420 

TATGCCCTGA TGCCCGAACT GGTCCAGTTA CACTTGATTC CAGATTGTCC TGAGTATATC 480 

CATCCAGTTC CGTACC 496 

(2) INFORMATION FOR SEQ ID NO:lBl: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 504 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 



CCACCCTCTA GCAAGTGGTG 


CTAATTCCTA 


TAGCCTAGAA 


GAAAACGAAA 


TCATCTACCA 


60 


AAAGTTAAAA AACTTGCTTA 


AAAACTGATA 


AAATACTTGC 


CAAACTTTTC 


AGAATCTGAT 


120 


AGACTAGTAT GGTAACAATC 


TATGGCTCGC 


AAAGAGACCA 


TGGCAGAAAG 


GAAATATTGC 


180 


AAAATGAAAA AAGATATCCA 


TCCAGAATAT 


CGCCCAGTTG 


TCTTCATGGA 


CACAACTACT 


240 


GGTTACCAAT TCCTTAGCGG 


TTCAACAAAA 


CGCTCTAACG 


AAACAGTTGA 


GTTCGAAGGC 


300 


GAAACTTACC CATTGATCCG 


TGTGGAAATT 


TCATCAGACT 


CACACCCATT 


CTACACTGGA 


360 


CGTCAAAAGT TCACTCAAGC 


AGATGGACGC 


GTGGATCGTT 


TCAACAAAAA 


ATACGGTCTC 


420 


AAATAATGAT AAGAGAACAG 


TTTCGGCTGT 


TCTTTTTTGT 


TTCTTGAAAT 


CAACTGCTGT 


480 


TTTCATGTTC CAGACTCATC 


TGTA 








504 



(2) INFORMATION FOR SEQ ID NO: 182: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:182: 

CTCAGGCGAT TACTAATTAC ATGACTTCTG CCTCAAACTT TAATGTCGAT GAGGCTAGCC 60 

AATTCATTCA ACAATTTACA ATTACAAAAC AAATCGAACA AGTAGAAAAA CTATTAGAGG 120 

AGTAGCATGG AAACTGCATT AATTAGTGTG ATTGTGCCAG TCTATAATGT GGCGCAGTAC 180 

CTAGAAAAAT CGATAGCTTC CATTCAGAAG CAGACCTATC AAAATCTGGA AATTATTCTT 240 

GTTGATGATG GTGCAACAGA TGAAAGTGGT CGCTTGTGTG ATTCAATCGC TGAACAAGAT 300 

GACAGGGTGT CAGTGCTTCA TAAAAAGAAC GAAGGATTGT CGCAAGCACG AAATGATGGG 360 

ATGAAGCCAG GCTCACGGGG ATTATCTGAT TTTTATTGAC TCAGATGATT ATATCCATCC 420 
AGAAATGATT CAGAGCTTAT ATGAGCAATT AGTTCAAGAA GATGCCGATG 



(2) INFORMATION FOR SEQ ID NO: 183: 



470 



<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 183 : 

CTTATACTTG CTTTTTTCCT TTTGAAAAAT GTTAATAAAT CGTTGGTAGT CTTCTCTAGG 60 

CATGGACAGA TCAATATCGT CGTCCCAAGG GATAAAGCCC TCATGTCGAA CCGCCCCAAT 120 

CAGAGTACCG TAGTTAATAA TATAGTTGAT ATTGTGCTTT TTACAGAGAG TATCAATATA 180 

ATCCAAAATT TCTAATTCAA TTTGTTTGGC ATCTTCAATG GTTAGTTGTT TCATTTTAAA 240 

CTCCTATGAT TTTTTGAATT TATTTTTTAA GGCTAGGACA TGGTTTAAAA ATTCATAGAA 300 

AATGCTATCT TTTGTGAAGA CAAGTAGACT AATATAAGAG ATAGCTGATA ACAAGACAAT 360 

CAAACCAGTA TTAATCAAAA ATGGCAAATT AATGACCATA TCCACAGGAT ACACGAAATT 420 

AATCAGGAAA TACATTGCTA CAAAGGAAAG TGAAAAGAGA GAGTATCGAA CAGTATAGCT 480 

AAAGATATGT CCCAAGTGGA TGAGTTGTTT CTATGGATGA AATGAT 526 

<2) INFORMATION FOR SEQ ID NO: 184: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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ACCATATCAA TACTATCGAA AAGTACAAGG GATNCAGTCT CAAGGTCGCT GAGGAAGATT 60 

TGAATGACCT AGACGATGGT GAATTTTACT ATCACGAGAT TATCGGTTTG GAAGTCTATG 120 

AGGGTGATAG CTTGGTTGGA ACCATCAAGG AAAATCCTGC AACCAGGTGC TAATGATGTC 180 

TGGGTGGTCA AACGAAAAGG CAAACGTGAT TTGCTTTTAC CTTATATCCC ACCAGTGGTT 240 

CTCAATGTTG ATATTCCAAA TAAACGGGTC GATGTGGAAA TCTTAGAAGG GTTAGACGAT 300 

GAAGATTGAT ATTTTAACCC TCTTTCCAGA GATGTTTTCT CCACTGGAGC ACTCAATCGT 360 

TGGAAAGGCT CGAGAAAAAG GGCTCTTTGG ATATCCCAGT TNTCATAATT TTTCGAANAA 420 

AATGCTGAAA AGG 433 



(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:185: 

CTGCAACTTG AATTTCTCCT TCTTGTTTCA AAGCAAGATA AACAATTCGA GCCCCTCTTT 60 

TTTCTAGCAA ATCCCCCATC TGGACAGATT GCATAAAGGA ACGAGAAGAA ACCTGATCAG 120 

AATAAGTCTG AAACTCTTCT TTCGTGAGTG TTGTTAGTGC CATATACTTA CTTTCTATGT 180 

TTTTTTCTTA ATGTTTTACG GAAATCAAGA GCAAGTCTTA ACAGAGGATA GAGAGGATGA 240 

GTGGGCATTG TAAATTCACC CAAGTATTCT TCAATCGTTG GATTAAATTT TTCCTTAAAA 300 

TGATAAAGTC CACCATTGAG AGAGTTTTCA ACACCACCTA AATTTTGCCA GACCATACCT 360 

CGCTCAAAGG CATAGCGAGC CGTTTCATAC CATGTTAAAA TTGGTGCATT GTAACGTTTA 420 

AAATCATCAT CCATACCAGC ATATATATTG ACAGAGGTAG TACCAAATTC CAAACTCAAA 480 

GTACCGCTAA AGGAATCTCG CTTGACCTAC ATCTATATAT TCCTGCAAGA AGGTCAATTC 540 



(2) INFORMATION FOR SEQ ID NO: 186: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:186: 
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CTCAGGCTAA AAGAGTCCAC TGGACTCTTT TACTCCGTCC CATAACCAAT GACTTAATGG 
CGCGTTCAGC TAGATTATTG GAAAGGACCA GATGTCCGTC TTTCAAAATA GTCTTAAAGG 
TTTCTTCATA CTTGAGGCTG TATTCAATTG CCCTTCCTAG TTTTGAACCC GATAAAACTG 
ACTGACGCCG GCACCAAGCA AAGAAATCTT CCATTAGGGG TTGGAGCTCT TCTTGACGTT 
TCTGTAGTCG TTCATCAGCT GACAATGTCT CCCAGTCTCT TTCCAAGGCA AATAACTGAT 
CACAATAGGC TAATCCTTTA G 

(2) INFORMATION FOR SEQ ID NO: 187 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

CTTGACCAAA CAAGGCAGTT GGAAATTGAT TTTGGAGGAT GAGAGCGCTG GAAGGGAACT 

CTTCCCAATC TTGACTCAAG GTCNATATAT CGCAACATTT GACCAACAAG CGCCANCTAT 1 

CGATGAAATC TTTAAACTAT AATCAGGAGT GGAAGTATGA GAAATATGTG GGTTGTAATC 1 

AAGGAAACCT ATCTTCNACA TGTCGAGTCA TGGAGTTTCT TCTTTATGGT GATTTCGCCG 2 

TTCCTCTTTT TAGGAATCTC TGTAGGAATT GGGCATCTCC AAGGTTCTTC TATGACTAAA 3 

AATAATAAAG TGGCANTACT GACAACAGTG CCATCTGTAN CATAAGGACT GAAGAATCTA 3 

AATGGTGTTA ACTTCGACTA TAAAGATGAA GCAAGTGCCA CCAGAAGCAA TTAAA 4 

(2) INFORMATION FOR SEQ ID NO:188: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

CTATNAAATA GAGTGGTTCT CTATCTGCCA TGACCACAAA TNAGGACAAT GATACTTCTG 
AACGTTCAGA CTGCTATCGT AAAAGGACAG CGGGTGAGAC GCCCATGAAT GATCTAACCA 
GTCATACCCA CGGAGGAAAT TATACAATAG CACGTTATCA GGAGGAAAAA TTTTGGAACA 
AACAGTTGTA AAATATAGTC TGCAGTTATC AATGCTTAAT TTTTTACAGG CGAAGATAAT 
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CGAACG 246 



(2) INFORMATION FOR SEQ ID NO: 189: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 



CTTGTCATCC 


TCACGAAAAT 


TGCGATTGTA 


AAGCACCAAG 


CCTTGACTCG 


TGATAGACTG 


60 


AATCATGCTT 


CTCTCATGTA 


CTCCTCAAGT 


CGTTTCATGG 


CTTCTTTGAT 


AGTCTCCATG 


120 


CTGGCTGCAT 


AAGATAGGCG 


GACGTAGCCT 


TCCCCGTAAC 


GTCCAAAGGC 


TGCACCAGGG 


180 


ATAAAGGCAA 


CGGCCTTCTT 


CTGAGCAAAA 


TCCTTCAGAA 


AAGCAAAGGA 


GTCTTGATTG 


240 


TAGCCCGCTG 


GAATTTTAGC 


AAAAATATAG 


AAGGCACCGT 


CTGGTTTGAT 


AATCTCAAAA 


300 


CCAAGAGCAG 


TCATTTTTTC 


GATGATATAG 


TCCCGACGTT 


GGATATATTC 


CTTCTTCATG 


360 


GGCTCCGCAT 


CGTTTTTACC 


AGCCGTCAAG 


GCTTCTACCG 


CAGCATGTTG 


CGCCATGGTA 


420 


TTTGCGGCAG 


TGACCAAGTT 


ACTGGTGACT 


CTTGATTAAC 


TGGGCTGTGA 


AGGTCNCANG 


480 


AGCAAAAATC 


AGCCCCAAAC 


GCCAACCTGT 


CATGGCATG 
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(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 



CTTCAGCTAT 


TCGTGGGATA 


TTAGATGTAC 


TTATTTAGGG 


GTTGAAATCA 


TATGAATATT 


60 


ACCAATTTGT 


TTTCTATCAA 


GACAGGATGT 


GATGAAACTG 


ATAGGCAACT 


GCAAAAACTA 


120 


TTTTTTCAGT 


TGGATTTACA 


ATTGGGAGAA 


TTGACAGATC 


AACTAAGAAA 


ATTAGATTCT 


180 


AATTTTGTTC 


CTCGTAGTCA 


ATTTGTAGAC 


ACGTTGGATT 


TGAATGATGT 


AGAATATAAA 


240 


GAAATTTTAA ACTATTTTAT CTTCCATCGT 


AATGATAGTG 


AAGAAAGTTT 


GGTAGAATGG 


300 


TTATATGATT 


GGATTTCCAC 


AAATCGTTAT 


GAACTTCCCT 


AAAGAGTTTT 


CCGATTCCGT 


360 


ATGGCTCCAT 


AAATACCCAT 


GAAAGTGTTA 


CTGAAGTTTT 


CCGGAGAATG 


AATAACTAAA 


420 


AAACAGTCAT 


TAGTGACTGT 


TTTTTATAGA 


AAAGAGTTTT 


TATATGTT 




468 
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(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 



CTTGAATAGT 


CTCATCACGA 


AAATTGCCAT 


AACTGATAAA 


ACCAACCACC 


TTCACGCCAT 


60 


CCATCGCAAT 


CAATGTATTT 


TCTGGATACT 


TTTGACTAAA 


GAATCGACAT 


CTTTCTAATG 


120 


TCATTGTATC 


CTGAAATTCC 


GCAGGCAAAA 


GGTCATCATA AGACTCTCTC 


CACGTTTGCC 


180 


AGTGAACGAG 


GGATTTGCCT 


TCTATCTCTT 


CAGGAGTTNC 


CATAGATTTG 


ATAATAACCG 


240 


TCATTTATTT 


TCTCCCAGTC 


TNCTCTCAAA 


ATACCATATT 


TAATACTATC 


AAAATATTTA 


300 


CCTTGATAAT 


AACGAACTTT 


NGGAATATGA 


GCTNCTTTTT 


TCATNCTTAA 


TTTTTCAGCA 


360 


AGTTTCATCA 


TACCAAGATT 


TCCTGACCAA 


GTTGTCAAAA 


CCAGATGCTC 


CAACTCCAAG 


420 


TAATCCTGAA 


ACGTCCTATC 


TATCCACTGC 


AACATAGCAA 


CTTTCCCAAT 


ACCAGTGTTC 


480 


CAGAATTTTT 


TATCATAAAT 


ACCAATTCCC 


AATCCATCCA 


TCTTGTTTCC 


TTTACATACC 


540 


CAATA 












545 



(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

CTTGATTGTG GAAGAAGTCC GTGATCGCTA TGTGGGCAAA GTCGATGCCG TTTTTCATAA 60 

CGGCGATTCT GAACTACGTC CGGATTCTCC ACTTTGGGAG GGCATCCGCG TTGTTAAAGG 120 

GAACATGGAC TTCTACGCCG GCTACCCAGA ACGTCTGGTG ACTGAGCTTG GTTCGACCAA 180 

GATTATCCAA ACTCATGGTC ACTTGTTTGA CATCAATTTC AACTTTCAAA AGTTGGACTA 240 

CTGGGCTCAG GAGGAAGAGG CCGCTATCTG CCTCTATGGT CACTTGCATG TGCCAAGTGC 300 

TTGGTTGGAA GGCAAGATCC TCTTTCTAAA TCCAGGCTCT ATCAGTCAAC CACGAGGTAC 360 

CATCAGAGAA TGTCTCTATG CTCGTGTGGA SATTGATGAT AGTTACTTCA AAGTGGACTT 420 

TTTGACACGA GATCACGAAG TGTATCCAGG TTGTCCAAGG AGTTTAGCCG ATGATGCCAA 480 
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(2) INFORMATION FOR SEQ ID NO: 193: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 



CGTAACCATT 


AACTATGAGG 


TGGTTTGCCT 


CCTCAGCGAC 


CGTATTCCGA 


GAGAATATTA 


60 


TTAGAAAAGA 


AAGGAGTGGA 


GCATGAATCT 


ACATCAACCC 


TTGCATGTCT 


TGCCTGGTGT 


120 


GGGACCAAAG 


TCAGCAGAAA 


AATACGCCAA 


ACTAGGAATT 


GAAAACTTGC 


AAGATCTCTT 


180 


GCTCTACTTT 


CCTTTCCGTT 


ATGAAGACTT 


CAAAACCAAG 


CAGGTGCTGG 


AACTAGAAGA 


240 


CGGTGAGAAG 


GCAGTTCTTT 


CTGGTCAGGT 


AGTGACTCCT 


GCTAGTGTCC 


AGTATTATGG 


300 


TTTCCAAGCC 


GCCAATCCGC 


CTGCGTTTTT 


AGTCCTCCAA 


GCCAGGGAAG 


AAGGTTCGTT 


360 


TTTTTTTGGC 


GGGTGAAATT 


TCCTTTTTAA 


CCCAGCCCCC 


TATCCTTGGG 


CTTGAATAAA 


420 


AAATTAGAAA 


TTTTGGGGAA 


CCAACCCCTT 


TGGCTTGT 
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(2) INFORMATION FOR SEQ ID NO: 194: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION ■ SEQ ID NO: 194: 

CCTTACTAGT TTACTCAATC TTTACATCGT TCTTTCGGCA GTTCAAACTC AGAATCCACA 60 

GCACAGGTTA TGCAATTACT GTCTGAAAAT ATGTTAAAAA CCATTCAGTC ACTTTCGGTC 120 

TGGCAGATTT ATTTGCTTGG TTTTGAGCGA ATCTTGGCGC TTGGTTTCCA ATTACTTTTG 18 

ACAGTTTGGG TTTACCAAGC TGTTCGCCAG AAGAAATGGA TTTATCTCCT AGCAGCCTAT 24 

GGCTTGCATG CCTTCTTTGA TCTGGCACCA TCTCTTTTCC AAGTAGGCTG GTTGACAAAT 30 

CCAGTCTTGG TTGAAGTGAT TCTAGCACTG CAGCTCGTTC TGGTCGCCTA TGGAACCAAG 3 6 

GAAATCTTTT GTAAAAAATC ATAAAAAGGG GGGAACCTCT TTTTCTTATG CAAAATCCAA 42 

AC AAGGTA1 T TTTATGGTCG TCAAATGTCT C7 AA AA ATGG TAT ?\ATGG AA TGAATTTTGT 48 

AAAAGGAAGA ATGACATGTC TGTAAGAGAA AAAATGCTTG AAATCTTAGA AGGAATTGAT 54 
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ATCCGTTTTA AGGAACCCTT GCATAG 

(2) INFORMATION FOR SEQ ID NO: 195: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO:195: 



CTTTGACAAA 


GAGTATGACA 


CCTGACCGTG 


AAGTCATTAC 


CTTTATTCCT 


GAAAAATTTA 


60 


TTGTGGATGG 


TTTCCAAGGG 


ATTCGTGACC 


CACGTGGCAT 


GATGGGGGTT 


CGCCTTGAAA 


120 


TGCGTGGTTT 


GCTTTATACA GGACCTCGTA 


CTATCTTGCA 


CAATTTGCGT 


AAGACGGTTG 


180 


AGCGTGCAGG 


TGTTCAGGTT 


GAAAATGTTA 


TCATTTCACC 


ACTAGCAATG 


GTTCAGTCTG 


240 


TTTTGAACGA 


AGGGGAACGT 


GAATTTGGTG 


CTACAGTGAT 


TGATATGGGG 


GCAGGTCAAA 


300 


CGACTGTCGC 


TACAATCCGT 


AATCAAGAAC 


TCCAGTTCAC 


ACATATTCTC 


CAAGAAGTGG 


360 


AGATTATGTA 


ACTAAAGATA 


TCTCCAAGGT 


TTTGAAAACC 


TCTCGCAAAT 


TAGCGGAAGG 


420 


CTTGAAACTG 


AATTACGGGG 


AACCTATCCG 


CCTCTTGCAA 


GCAAAAAACT 


CCATTAAAGT 


480 


TATTGGAGAA 


TTAAACCAGT 


CAAAGTGACG 


GAACCTACTT 


GTC 




523 



(2) INFORMATION FOR SEQ ID NO: 196: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 



CTGTGATTTC 


AGAGAAGAAA 


TCAAGTGCTG 


TAACAGAAGT 


AAGATGTAAT 


TGTATGTAAA 


60 


GGAGACGTCA 


TGTTAAATAG 


TATTGTAACC 


ATTATTTGTA 


TTGCCCTTAT 


CGCGTTTATC 


120 


TTGTTTTGGT 


TTTTCAAAAA GCCTGAAAAA 


TCTGGACAAA 


AAGCCCAGCA 


AAAAAACGGA 


180 


TACCAAGAGA 


TTCGAGTGGA 


AGTCATGGGA 


GGCTATACTC 


CTGAGTTGAT 


TGTCCTCAAG 


240 


AAATCAGTGC 


CAGCCCGCAT 


TGTCTTTGAC 


CGCAAGGATC 


CTTCACCATG 


TCTGGATCAA 


300 


ATTGTTTTTC 


CAGATTTTGG 


TGTACATGCG 


AACCTGCCAA 


TGGGGGAAGA 


GTATGTAGTG 


360 


GAAATCACGC 


CTGAACAGGC 


TGGAGAGTTT 


GGCTTTGCTT 


GTGGTATGAA 


CATGATGCAC 


420 


GGCAAGATGA 


TTGTAGAGTA 


GGTGGAGACT 


ATGACAGAAA 


TTGTGAAAGC 


AAGCTTAGAA 


480 
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AATGGCATTC AAAAAATCCG TATCCGAGCT GAAAAAGGCT ATCATCCACC CATATCCA 538 



(2) INFORMATION FOR SEQ ID NO:197: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51S base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 



CCTCAAGAAG 


TCCGCGATAC 


TTACGAAAAT 


ATTTTGTAGA 


GAATCGAACC 


GCAAGGTTCG 


60 


GTTTTCTTTC 


TCTTTTTGTC 


TATAATTTGG 


TATAATAAAC 


AGTATGAAAA 


TCGTATCAGG 


120 


AATCTATGGG 


GGACGTCCCC 


TCAAGACACT 


AGAAGGCAAG 


ACAACAAGAC 


CTACTTCGGA 


180 


TAAGGTTAGG 


GGAGCCATTT 


TTAACATGAT 


TGGTCCCTAC 


TTTGAAGTGG 


GACNAGTCTT 


240 


GGACCTTTAT 


GCAGGTAGTG 


GTGGTTTATC 


TATCGAAGCA 


GTATCGCGTG 


GCATGTCCAG 


300 


TGCTGTTTTG 


GTGGAGCGAG 


ACCGTAAGGC 


TCAGACCATC 


GTGGCTGAAA 


ATATCCAGAT 


360 


GACCAAGGAA 


GTTGGAAAAT 


TTCAACTCCT 


CCAAAATGGA 


TGCAGAAAGG 


GCATTGGAAC 


420 


AGGTTNTCTG 


GGGAATTTGA 


CCTCGTTTTC 


TTAGACCCTC 


CCTATGCCAA 


GGAACAAATC 


480 


GTAGCAGATA 


TTGAAAAAAT 


GGCTGAGAGA 


GAGCT 






515 



(2) INFORMATION FOR SEQ ID NO: 198: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 



CTGGANACAA ACTTTACAGA TCAAGTCGAT ACGATGATTT ATGTTGATAA AGAAGAAAAA 
GAAACTATTA AAGCTGCACT TGTGGAGTTT TTTAATGGAA AAGTCACTTT AACTGACCAA 
GGTTTACGAG AGGTTGAAGT TCCTGTAAAC TTAGTGTAAA CAATGAATAA TACAGCGTTT 
CGTTGACATT CTCACAACTA CTTTAGCGAG CAAAATAAAA AGATGCGTAC CAAAATATAC 
TAGAAAATGA AGCAATTCAA ACGAAACCTG ATATCGTTTT CCTTCACACC TATTTACTAG 
AATTAACTGA ACGCAATCAC TTGAAAATTA ATGACTTTGG AT 



{2; INFORMATION FOR SEQ ID NO: 199: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 



CTATTAGCGA 


CTTTCTCTGA 


AATATGGTAT 


GATAAAGGAT 


ATACAAGGAG 


ATAAAATGAA 


60 


TAATAATTTA 


CTGGTATTAC 


AATCCAGACT 


TTGGTCTGGT 


TGATGGTGCG 


GTATCGGCTA 


120 


TGATTGGAGT 


GGCTTTAGAA 


GAGTCTCCAA 


CCTTAAAAAT 


CCATCACTTG 


ACGCACGATA 


180 


TCACGCCTTA 


TAATATTTTT 


GAGGGGAGCT 


ATCGTCTCTT 


TCAGACGGTG 


GATTACTGGC 


240 


CTGAGGGAAC 


GACGTTTGTA 


TCGGTTGTCG 


ATCCAGGTGT 


CGGTTCGAAA 


CGTAAGAGTG 


300 


TAGTTGCCAA 


GACTGCTCAA 


AAATCAATAC 


ATTGTCACGC 


CAGATAATGG 


GACGCTTTCC 


360 


TTTATCAAGA 


AACACGTTGG 


CNTTGTANCC 


ATTCGTGAGA 


TTTCTGAGGT 


GGCCAACT 


41B 



(2) INFORMATION FOR SEQ ID NO:200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 



CTCTCGCTTT 


TCTCATAGTG 


GGAGGTAAGG 


ATGGAATTAC 


GCAGACCAAG 


ATTAGCGGAT 


60 


AAGAAAGCTG 


TTTTAGATAT 


GATGACAGAG 


TTTGAAAAGA 


GCCAATCAGC 


CCATGATGGA 


120 


GGATTTTGGG 


ATACAGAGAA 


TTTTGTGTAT 


GAAGAGTGGT 


TGGAAAGCAA 


TCAGGAACAG 


180 


GAAATGGGGA 


TTAATCTGCC 


TGAAGGATGG 


GTTCCTGCAA 


TTCAGTTAGT 


GGCTTTTTCT 


240 


GAGAAAGGTC 


AAGCAGTTGG 


ATTTCTTAAT 


CTCCGGTTGC 


GCCTCAGTAA 


CTTTCTACTA 


300 


GAAGAAGGTG 


GGCACATTGG 


CTACTCCATC 


CGTCCATCTG 


AAAGAGGCAA 


GGGTTATGCA 


36C 


AAAGAACTCT 


CCGTCAGGGC 


TTGCAAGTTG 


CTAAGGAAAA 


GAACATCAAG 


AAAGCTCTGG 


420 


TGACCTGTAG 


CGTGAATAAT 


CCTGCTAGCA 


GAGCATCATT 


CTA 




463 



(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 475 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:201: 



CTCTTACGTG 


ACCAGAACAT 


GCTACTTCCC 


ATCCGAGAAG 


CCATTCTATC 


TGGCTTACCA 


60 


GTGTTTGGGA 


CCTGTGCGGG 


CTTAATTTTG 


CTGGCTAAGG 


AAATCACTTC 


TCACAAAGAG 


120 


AGTTATCTAG 


GAACTATGGA 


TATGGTGGTC 


GAGCGTAATG 


CTTATGGGCG 


CCAATTANGA 


180 


AGTTCTNCAC 


GGAAGCNNAA 


TGTAAGGGAG 


TTGGCNAGAT 


TCCNATGACC 


TTTATCCGTG 


240 


GTCCGATTAT 


CAGTACTGTT 


GGTGAGGGTG 


TANAAATTCT 


AGCAACAGTG 


AACNATCAAT 


300 


TTGTTGCNNC 


CCAAGANNAA 


AATATGTTGG 


TAAGTCCTTT 


TCATCCAGAA 


TTGACTGATN 


360 


ATNTGCGCTT 


GCNCCAGTAC 


TTTATCAGTA 


TGTGTAAAGA 


AAAAAGTNGA 


GATTGAATTT 


420 


CTCCAACTTT 


NCCACNTGTA 


ATNNNCAATN 


NCNATGTATT 


GGAGTACGGA 


CGCAG 


475 



(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 02: 



CCTTCAAAAA 


TTATTTGATG 


CAACCTTAGA 


CAATCCAGAA 


CTCACCGCAA 


CATTAGTGCC 


60 


TTTAGGAGAT 


GGTATTCTCA 


TGCTTCGTAA 


AAATGTAGCA 


GATGTTCAAC 


TGTCTGAAAG 


120 


CGAATGATTT 


TCAGAAAAAT 


TTAAGAAAAA 


ATAGTAAAAT 


AGATAGAGTA 


ACACTTATCT 


130 


CAAAGGAGTA 


GACATGAAGA 


AAAAATTATT 


GGCAGGTGCC 


ATCACACTAT 


TATCAGTAGC 


240 


AACTTTAGCA 


GCTTGTTCGA 


AAGGGTCAGA 


AGGAGCAGAC 


CTTATCAGCA 


TGAAAGGGGA 


300 


TGTCATCACA 


GAACATCAAT 


TTTATGAGCA 


AG TG AAAAAC 


AACCCTTCAG 


CCCAGCNAGT 


360 


CTTGTTAAAT 


ATGACCATCC 


AAAAAGTTTT 


TGAAAAAACA 


ATATGGCTCA 


GAGCTTGATG 


420 


ATAAAGAGGT 


TGATGATACT 


ATTGCCGAAG 


A 






451 



(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 203: 



(2) INFORMATION FOR SEQ ID NO: 204: 



60 



CTATGGATAA GGCACTTGCT GACCTCAAAA CATCAGGGCA CTTGCCTATT CCGCGACACC 

TACGTGATGG GCACTACANT GGAAGCAAGG AACTGGGGAA TGCCCAAGAC TATCTCTATC 120 

CACACAACTA TCCTGNAAAT TGGGTCAAGC AAGACTATCT GCCACAAAAA ATTCGTAATC 180 

ATCACTATTT CCAAGCAGAA TATACTGGTA AATATGAACG GGCTTTGGCT CAAAGAAAGG 240 

AAGCTATCGA CCATTTGCGA AAAATCTGAA ATCCTTTTCA AAAAATTGCA CTTTCCTCTT 300 

GATTTTTTTT GAAAAAGTGG TATCATATAA ATATAGAAAC GCTGTGGTGT ACGACTTCAC 360 

ACTTAAGTGT TGACCGACTA TTTTTTGTAT TATTANGGAA ACAAAAGTCT TCTGTCAGCA 420 
TGTAGGCCGT CTCACACGGA AACAGCTTCA GTT 



453 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

CCGTAGTACA GGCACAAAAA GATTTGGAAA ATAGAAAAAG AAAAGCCAAG AAAAAGGCTC 
AGAAAACGAA ATAAATAAGG AGGAATCTGG TAATGGTAGT ATTTACAGGT TCAACTGTTG 
AAGAAGCAAT CCAGAAAGGA TTGAAAGAAT TAGATATTCC AAGAATGAAG GCTCATATCA 
AAGTCATTTC TAGGGAGAAA AAAGGCTTTC TTGGTCTATT TGGTAAAAAA CCAGCCCAAG 
TGGATATTGA AGCGATTAGT GAAACGACTG TTGTCAAAGC AAATCAACAG GTAGTAAAAG 
GCGTTCCGAA AAAAATCAAT GATTTGAACG AGCCTGTGAA GACGGTTAGT GTAAGAAACC 
GTTGACCTTG GTCATGTGGT TGATGCTATT AAAAAAATAG AGGAAGAAGG TCAAGGTATT 
TCTGATGAAG TC 

(2) INFORMATION FOR SEQ ID NO: 205: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: lines r 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:205: 



AAAATCAAGA 


AATTCTTTGA 


AGGAACGACA 


GATACAGCTG 


AGAACTATAT 


CAAGTCGGCC 


60 


CTTAAAATGT 


TGGTCAAATA 


GGAGCAGAGA 


ATGACAAAAC 


GTTGTTCGTG 


GGTCAAGATG 


120 


ACCAACCCGC 


TCTACATCGC 


CTATCATGAT 


GAGGAGTGGG 


GTCAGCCCCT 


CCATGATGAC 


180 


CAAGCATTGT 


TTGAGTTGTT GTGTATGGAA 


ACCTATCAGG 


CAGGCCTGTC 


TTGGGAAACG 


240 


GTACTCAACA 


AACGCCAAGC 


TTTCCGAGAA 


GCCTTTCCAT 


AGCTATCAAA 


TTCACTCCAG 


300 


TCGCAGAGAT 


GACTGACACT 


GAATTGGAAG 


CCATGCCTGG 


AGAATCCCAG 


CTATCATTCC 


360 


GAAATAGAGC 


CCAAGATTTT 


TGCTACACGC 


CGCCTAACGC 


CCCAACCCTT 


TCCTACAGTT 


420 


ACAGGCAG 












428 



(2) INFORMATION FOR SEQ ID NO: 206: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 



CTATTAGATG 


GTATTGACTG 


CCCAGACACT 


TACGGATCTA 


GCTGAGACAG 


GGAATTGTCC 


60 


ATAACCTTCC 


TCATCAATTG 


TAACTTGACC 


TTGGTGGTTA 


CCAAGTAAAT 


CTACAAAGGT 


120 


TTGATTAGTC 


CATTCTTGAC 


CGACAAACAT 


TGACTTGCTG 


TTTTCTTGGT 


CATTTGAAAT 


180 


CAAGACTGCG 


ATTGGGGATT 


GATTTTCAGC 


ACCTGAACGT 


ACCCAACCGA 


TACAGTTAGC 


240 


ATGGTCAAAG 


TAGTCATTTT 


GTTCTCCATA 


GGCCAAATCT 


TTTCGGATGG 


CTAGGAGGCG 


300 


GTCAAGGATT 


TCTTTGAAAT 


CTTGTTGAGC 


ATACTGGCCT 


GAAATCC C AT 


AGTAGTCTCC 


360 


GTAAAAGACA 


CATGGAAGGC 


CGTCTTGGCG 


TAACAAAATG 


AGGGCATAGG 


CTGCTGGCTT 


420 


GAACCATTCT 


TCAACGGTAG 


ACTCAAGGGC 


TTGTCCTCGT 


TGGGTATCGT 


GGTTGTCGAC 


480 


AAAATCACAG 


ACTTGTCAGG 


CTTGAGTTCA 


ACCAGGCT 
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(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 07: 



CTCTGTCCTT TTACCTGAGA GTTTGAGCAG TTGCCTGCCT TCCCCCTTCG GTGCCTTTAC 60 
GGTCTCTCCA GAGTTCCGTC CATTTACAGT CATGGAAAAT CAAACGATTC CCCACTTCTA 120 
TTAAACTTCA TTCGGTGTTG GTATTTAATT GATTCTAATT TTACAAAAAA TGTTGGCTTT 180 
TGTCAATGTG TTTATTAGTA AAAATTAGTT CAACAGTTTT TACTTTATAA AGTCCAGAAT 240 
ACTGCTATCC TTTAAAAGTG ACAATAGTCG CACCACTGCC TCCAGCATTT TGTGGGGCAT 300 
AGCCGAAACT CTTGACATGT TTGTTTCTTT GCAAGTATTT GGTAACTCCT TCACGGATGA 
CTCCTGTTCC GATACCATGG ATGATATCAA CTTGAGCCAT ATTGTTAAGC AAGGCTTGGT 
CGAATGAAGG TATCTAGCTC ATTCATGGCT TCTTCATAGC GCTTGCCTCG AAGATTCAGT 
CTAGCTTGAG TCCTCGCCCA GAAGTTCG 



360 
420 
480 
508 



(2) INFORMATION FOR SEQ ID NO:208: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 



CTCTTGGGCT CTCTTCTCAG CAACTTCCCT GCTTTCACCT TTGGTCTTGA TAGGTGANAA 
CGTAATATNT CCAATCNNNG TCATATTAGG AAANAGTTAA ATTGTTGGAA AACCATGCCC 
ATCTTCTCAC GCATGGGAAA CAGGTCATTC TTCTTGTCCG TAATATCGAC TCCCTCAAAG 
ATAACCTTCC CCTTGGTTGC TTCCTCCAAC AAATTCATAT AGCGAAGCAA GGTAGATTTC 
CCACTCCCTG AAGGACCGAT GATAACGACA ACTTCTCCTC TTTTAATCTC GAGGTTGATG 
CCCTTCAATA CTTCATTCTT TCCAAAAGAT TTATGTAATT TTCAATTTTT ATCAAGGTTT 
CTGTCATTAT TTCTTATC 

(2) INFORMATION FOR SEQ ID NO: 209: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION. SEQ ID NO: 2 09: 

175 



WO 97/37026 PCT/US97/05306 



CTGGCCAGCC 


AGATAAACCT 


CGATACGGCT 


TTCTGCTCGT 


TTGATGCGTT 


GCAAGAGTCC 


60 


GCCCATACGG 


ACGTCAACTG 


TATCCAAACC 


AAAGACCTTG 


TTTTATTTCA 


GCCAGTGGTG 


120 


APTAAAGAGG 


GCATGGAAGT 


CTTCAATTTG 


GCTTCTAAGT 


TCTGGTAATT 


CTTGTCTGGC 


180 


^ 21 TTTGTTGT 

UAj. J. 1U A. iUi 


AAACTTTCTT 


TATCATCCGC 


TTGGTAGGCA 


TGACGAATGC 


GTCGTCCCAC 


240 


ATCTACTTTG 


CTACTTAAAA 


TAGCATTCAA 


CTGGGCCTGA 


GTTTCAAAGA 


GATAAGCATA 


300 


111 1 AVjU 1 


An 


TGTPAGCAAT 


AGTTCCCGCC 


GCCTGAGCGA 


AGTGTGGCTT 


360 


GTCCTGTTCA 


GGTGTCATGT 


GTCGGTCAAG 


TATCGGACAG 


AGAACATCCT 


GATAAAAGAC 


420 


ATAGCGGTTG 


GGATTGATGC 


CACTGAGATT 


ACCTGGTAGG 


TCTGGTAAGA 


GGTTGGCAAG 


480 


ATCAATCTGC 


ATAAAATCCT 


CAACCGTTAG 


ACCAGTATTG 


GTCTTGAAAT 


GCGCAGACAA 


540 


ACTATCTAGG 


TCATTGCGAT 


AGATCAAGTT 


ATCGATACCG 


TCGACCTCGA 


GGGGGGCCGT 


600 


ACCCAGATTT 


GCAAGCTTGG 


TAGGATAAAG 


AACTGGGCAG 


TTTCACCACC 


ATTGTCTCCC 


660 


CAACCCGTTA 


CGATGACTTC 


TTTAATCTGA 


TTGGCACGGC 


AGGCTTTATT 


AGCCTCGATA 


720 


GCCACTAGAC 


GGCTAAAATG 


G 








741 



(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 



AAACTCGATT 


TGAGAGAAAG 


TCCAATAAGT 


CTTTCCATAA 


TAAAACGCAT 


AGTAGCAAGA 


60 


GTTTCTACAC 


CCACTATTAT 


GCGTTTTTAT 


CATTTTGAAA 


TCTACTGCTA 


ATCAATGAAG 


120 


AAGCAAAGAT 


CAAACTAGGC 


AGCTAGTTAC 


AAGTTACGTA 


AGCGCGTCAT 


AACAAGGTAT 


180 


CTATCCTAAT 


TCCCCAACTT 


CCAGTATACT 


AGATAAAAAA 


TAATTATCGG 


AGGAAAGGTA 


240 


TTGTACTATG 


ATAATTCAAC 


TAAGTGATTT 


AGGTCAAGTT 


CACCTTGTTT 


GTGGCAAGAC 


300 


AGATATGAGG 


CAGGGAATAG 


ACTCATTAGC 


CTATGTAGTT 


AAAACCCACT 


TTGAATTGGA 


360 


TCCTTTCTCC 


GGTCAAGCCT 


TTCTCTTTTG 


TGGTGGACGT 


AAAGACCGCT 


TTAAAGCCCT 


420 


TTACTGGGAT 


GGTCAAGGAT 


TTTGGCTACT 


ATATAAACGC 


TTTGAGAACG 


GAAAACTGAC 


480 


TTGACCAAGT 


ACAGAAAAGG 


ATGTCAAAGC 


TCTCACACCA 


GAACAAGTAG 


ACTGGCTTAT 


540 


GAAGGGCTTT 


TCTATCACTC 


CCCAAATATA 


GTGGATTTAA 


ACTAGAATAG 


TACACCCCTA 


600 


CTTCTAAAAC 


ATTGTTAGAA 


ATCGATTTGA 


CTGTCCTGAT 


CAATTTGTCC 


TGTTCTTATT 


660 


TCATTTTACT 


ATAGAATCCA 


TCTGAAAGCC 


GTTTATGATT 


TCTATTGAAA 


TGAAGACCGT 


720 


CCATTTTAGT 


AGACTAAAGG 


ATTACTCAAA 


ACTTCGAGAA 


GGAAGACAGA 


TGAACTTGTT 


780 


TCTTATAAAA 


ATGTTGTCTG 


AAACTATTCG 


CTTTTTTCCA 


GAAATTTTAG 


AGGAAGAAAA 


840 


CTTTATGAGG 


AAAAAGGAGT 


TACTCAAATA 


ACAATTCAGG 


ATTAAAAATA 


GACAGTTGAG 


900 


GAGCGGAAGG 


TATAAATTAA 


GTTTGCTACT 


GTATAATGGA 


TTTAAATCAC 


TCAGCAGACA 


960 


GAACGAATAC 


TCTTCGAAAA 


TCTCTTCAAA 


CCACGTCAGC 


TTCGCCTTGC 


CGTAGATCCA 


1020 
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TAGGGGACTG GACTTCATCA GTTCTATCAA CAACCTCAAA ACAGTGTTTC GAGCTGACTT 1080 

CGATCAATTT TATCTGCACC TCAAAGCTGT ACTTTGAGCA AGCCTGACGA CTAGCTTCCT 1140 

ATTTGATTTT CATTGAATAT CAGAAACTCA TTCTCCATCA AATAATTCGA CTGCGTCTAA 1200 

TAATTTTTGG TCTGGCACGG TGTCAGAAAT AAAGGTTGTG TATTTGGAGA GGGGATTAAT 1260 

TTTAAAAAAT CCAGTCTTGT AAAATTTAGA ACTATCAATC AGTAAGATGG TTTCATGGGC 1320 
TTTGTCAATA ATATTCTTTT TTGAAATAG 



(2) INFORMATION FOR SEQ ID NO: 211: 
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(i) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 1359 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:211: 



CTTTTTTATA AGCAATTCAA TTTTATACTC ATCTGCTTTC AAAAAGCATT CTAGTCCATC DU 
TCCGATTAAC GATGGACTTT ATCACCTCCT TCTCCAATCC TTGTATGACA TCTTGAAGTT 120 
GATTCATGAC ATCTTCCAAA ATTCGAAAGG CTTTATTCTT AAATCCACGT TTACGAATCT 180 
CTTTCCACAC TTGTTCAATG GGGTTCATCT CTGGTGTGTA TGGAGGAATA AATGCAAAAC 
CAATATTAGT CGGAATCTTT AAGACACTTG ATTTATGCCA TATAGCATTG TCCATAACGA 
GTAAAAAATA ATCATCTGGA TAAGCTTGTG AAAGCTCTTC TAAAAAGGCG TTCATCCACT 
CAGTATTACA TCTACCAGCT ATTAAGAAAA AATGATTCCG CCTGTTATGG GCATCAACAG 
CTCCATAACA ATAGTGAAAT TCTCGTATAT AGTGACTATG GACATGTGGA CCTACTCCTA 
TTGGAGACCA ACAAGATCCC AGTTTACTGA TTCTACCGAA ACCAGCCTCA TCTTGGTACA 
TCAAGCGAAC CTTATGAAAA CGTCTACTGG TTTAAATCGC TTTCTTGTCT TCTTGAATTG 
AGATTTTATT TTTAGACGCG ACAATGGTTT GAGCGTCTGC TTTCTTAGGA TGTTCTGGAC 660 
GTGGCATAAT ATTTCGCCAG CCATGGCGCT TCAACAGTTG ATAGAAGGCA TCACGTGTGT 
AGGAACGACC TAACTCCTTT TTATAAGCCT GAAATAAGGC ATCAATTGTA ACAAATTCTC 
CTGCCTCTGT AGCCTTCAAA TGGCGGGCAA GAAATACTTT CTCTTCCTCA ACTGTCATAT 
ATGCATGGTT ACACCACCAC TTTCCTGATA GAGTTGTTCA CATCTTATTT CAAACTACTA 
TAAAAGTTCT ATAATCTCTT TATAAGATTT GCCCATCAGA CGAAATATAA TAAATTGAAA 
CTAGAATAGT ACACCTCTAC TTCTAAAACA TTGTTAGAAA TCGATTTGAC TGTCCTGATC 
GATTTGCCCT ATTCTTGTTT CGTTTTACTA TAGAACGATT TGAAGGCGTT TATGATATTG 
AGCTGTACGA GAGTCTTTTA AAAGTGTTTT GATGGCTTGG ATTTCTTCTT TAGTTGATTT 
CATATCACTA TTATATAATG CTTTTTGATC TTTAGTTTAG TATTATCGTA GAAAATGGAC 
TAGOTGCTAT GGATACACCG GATGAGAATG GTTATOTAGC AGATGACTAT CGGATTACTT 
ACTTAGAGGC CCACATCAAG GCCATGCGAG ATGCCATTTA CCNAGACGGG GTTGACTTGC 1320 
TTGGTTATAC GACTTGGGGC TGTATCGATC CAGTTTCAG 1359 

(2) INFORMATION FOR SEQ ID NO: 2 12: 
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240 
300 
360 
420 
480 
540 
600 



720 
780 
840 
900 
960 
1020 
1080 
1140 
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1260 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 



CTAGTATTTC 


CGTTGTGNTT 


AGTTTGTCTG 


CCTATGGAGT 


TATTGCCTAT 


GTGCAAGGTC 


60 


AGNTGGATAT 


TCTTCTAGTG 


ATTCTGGCCA 


TGATTGGTGG 


TTTGCTCAGT 


TTCTTCATCT 


120 


TTAACCATAA 


GCCTGCTAAG 


ATCTTTATGG 


GTGATGTGGG 


AAGTTTGGCT 


TTAGGTGGAA 


180 


TGCTGGCAGC 


TATCTCTATG 


GCTCTCCACC 


AAGAATGGAC 


TCTCTTGATT 


ATCGGAATTG 


240 


TGTATGTTTT 


TGAAACAACT 


TCTGTTATGA 


TGCAAGTCAG 


TTATTTCAAA 


CTGACAGGTG 


300 


GTAAACGTAT 


TTTCCGTATG 


ACGCCTGTAC 


ATCACCATTT 


TGAGCTTGGG 


GGATTGTCTG 


360 


GTAAAGGAAA 


TCCTTGGAGC 


GAGTGGAAGG 


TTGACTTCTT 


CTTTTGGGGA 


GTTGGGCTTC 


420 


TAGCAAGTCT 


CCTGACCCTA 


GCAATTTTAT 


ATTTGATGTA 


AGAATGGCAC 


CCTGATGTTT 


480 


TAGGGTGTTT 


TTGTGTTTAA 


ATACACAATG 


AAAATCAAAG 


AACAAACTAG 


AAAGCTAACT 


540 


TTAGGCTGCT 


CAAAACATAG 


TATATTGAAA 


CTAAAATAGT 


ACACCTCTAC 


TTCTAAAACA 


600 


TTGTTAGAAA 


TCGATTTGAC 


TGTCCTGATT 


ACGATTTATC 


CTGTTCTTAT 


TTCATTTTAC 


660 


TATACCATCA 


ATAAGTGTTG 


AAAAGATCGT 


TGACAGAACT 


GACCAAGCCA 


GATCATCTTT 


720 


GTAACCACCT 


TGAGCTTGTA 


GCATATTTGT 


TAATTGAGCA 


AAGGCGCTAG 


AAGAGTTTGC 


780 


TGGGTCAGCA 


GTTGCGATTT 


TTCCTTTTAG 


TTCAGGTTTG 


AGAAGGTCGC 


TATATCCTTC 


840 


GATGTTCATG 


CCTTTAGTTA 


AATCAGGGTG 


GACGATTAAA 


ACACTACCAT 


CTAGTGTATA 


900 


AGGAGTAGAG 


AAGCCAGTTG 


TGTTTTGATA 


TTCTTTGATA 


ACATTATCAT 


TTTCTTTTGA 


960 


AATATAGTTT 


TCAAAGAGTT 


CTCCGTGGGT 


AGCATATTGT 


GTTATAAGAA 


CCACCAAAGA 


1020 


TAACATCAGC 


TACAGGAACT 


TCTTTTTCTG 


ACCTAGTTTT 


TTGAAAAGTT 


CTCCAGTACC 


1080 



AG 1082 
(2) INFORMATION FOR SFQ ID NO:213: 

(i) SEQUENCE CHARACTERISTICS. 

(A) LENGTH: 1303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE * Genomic ON A 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

CTGGACTTGA TAGGCATCTT TGTAATCCTC TAAAGCCTCT TTCATCAAGG CACTACCAAT 60 

178 
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TCCTTGACGC TGATAGATTG GTAAAACGAT TAAATCCTGA ACCAATACTG ATGAGAATCC 
ATCTCCAACC AAACGAATCA AGCCCACCAC AGCATCACCA TCAAGTCCCA CATAAATTAC 180 
TAATGAATGA GATAAGGCCT GCTCCAGCAT CTCTGGTTGA TGGGTATAAT TAGTCCAACC 240 
GACAGCCTGA TAGACATGTA AAACATCCTC TAGCTTGACA ATTTCTTGCT TTTTAATAGT 
AATCATCTCA ACACCTCTTA AAGTTCTCTC AAGCTCTTGT ACTGCTGTCC ATTTTTATCA 
AAATTTTCAG GACGCAACCA TGTTTCCAAA CGATCTTTGA CTTTGGGCCA GTCCTTATCA 
ATCATAGACA ACCAATCCAT ATCTCTCGTA CGCCCCTTAT AAACCACTGC CTGACGGAAG 
GTTCCTTCAT AAATAAAGCC CAAACGCTCC GCAGCACGTC TGGATGGCAG GTTAAGAGCA 
TCGCATTTCC ACTCATAACG ACGATAGTTA AGCTCTTCAA AGACATAGCG AGCCAGTAGA 
TACTGGGCTT CTGTCCCTAT CCGTGTCCCC CTGAGTTTTG GAGAAAAAGT GACAGCTCCC 
ACTTCTATTA CTCCGTTACT GGTCAATACG CATGAGAGAA AAATTCCCAA AGCCTTACCA 
GTTGCCTTGT CTATGTAGTA AAAACGGTCC TTACGAGCCA ACATCTGACT ACTATAGTAG 
ATTGAAACTA GAATAATATA CCTCTACTTC TAAAATATTG TTAGAAATCG ATTTGACTTC 
CCTGATCGAT TCGTCCTATT CTTATTTCAT TTTAATATAA TTGATAGTGG TCGCCCCAGC 
CAGATACCTT ATCTGCTATC CATTTAGGAA CCCCTAACTT AAGCAATCCC CATAATCGTC 
TCGATTTCTT CTTCCATTGC TTCCAGATAA TCACTCGTAG GCGAGTACGC AAGCGCTCAT 
CTATGTTAGT GACTATACTT TTCATATTTA TAATTCATTC CTTTCGTTTC ACTCAAGGCA 
CAACACAGAA TGAAAAAGTG TTGTGATCTT TATTTTGTTT TATAATAATA GTGAGAAAGC 
CTATCACTAC TACAAATCAC GGGGAGGTGA ATAAGTGAGT GGTACAGCCA CTACCTCGCA 
TATTTTGTCA CATCATTTAA CGGTACATAA TAAGTTGTAC CATCTGAATA AGTTGCTACA 
ATATCATTTG CATGCTCTCC TTCACCTTTA GCAAAGGTTG GAG 



(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1567 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

CTCTTCAATC AACTCACGCT TTAGGGCTTC CTGATGATTT TCACCTGCTT CAATTTCTCC 
ACATGGTAGG AACCAAGCAC CATTTGGTTC TTGAACAAGA ACAATTTGTT TTTGTTCAGG 
ATTAGGGATA ACTGCATATA CGCCATAGCG AGCAATATAG TCTGTATTCG CTTTTTTCTC 
CGAAAGTTGG GTTTGCCATT GCATTTTCCT CATTATCTAG TATCGTTATT ATTATAGTGA 
AATGAACCAA AAATAGTACA CAATGTGGTA TAATCTTTTT ATGGCATATT CAATAGATTT 
TCGTAAAAAA GTTCTCTCTT ATTGTGAGCG AACAGGTAGT ATAACAGAAG CATCACACGT 
TTTCCAAATC TCACGTAATA CCATTTATGG CTGGTTAAAG CTAAAAGAGA AAACAGGAGA 
GCTAAACCAC CAAGTAAAAG GAACAAAACC AAGAAAAGTT GATAGAGATA GACTTAAAAA 
CTATCTTACT GACAATCCAG ACGCTTATTT GACTGAAATA GCTTCTGAAT TTCGCTGTCA 
TCCAACTACC ATCCACTATG CACTCAAAGC TATGGGCTAC ACTCGAAAAA AAGAACCACA 

179 
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CCTACTATGA 


ACAAGACCCA 


GAAAAAGTAG 


CCTTATTTCT 


TAAGAATTTT 


ANTAGTTTAA 


660 


AGCACCTAGC 


ACAGGTCTAA 


CAAATGGTGA 


ATTAATCGCT 


CCAATGACTT 


ACGAAGAGAT 


720 


GATGACGAGC 


GACTTTTTTG 


AAGTATGGTT 


TCAGAAGTTT 


CTCTTACCAA 


CATTAACCAC 


780 


ACCATCGGTT 


A 11 A 1 AviToA 


x ik « mil x 

AATGAAATAA 


GAACAGGACN 


AATCGATCAG 


GACAGTAAAA 


840 


TCGAATTCTA 


AC AATGTTTT 


AGAAGTAGAG 


GTGTACTATT 


CTAGTTTCAA 


TCTACTATAT 


900 


TAATGGACAA 


TG C AAGATTC 


CATAGAATGG 


GTAAGCTAGA 


GTTCTTATGT 


GAAGAGTTTG 


960 


GGCATAAACT 


TTTACCTTTT 


CCTCCCTACT 


CATCTTAGTA 


TAGAAAAGTG 


AATCTAAAAT 


1020 


AGTACATAAC 


TGCTTCTAAA 


ACATTCTTAT 


AAATTGATTT AAATTCTCAA ATCATATTAT 


1080 


TCAGTTCTTA 


TTTCATTTTG 


CTCTACAATC 


CTGTTGAGAA 


GACACGTGTT 


CATATCAAAA 


1140 


AGGTATTGGC 


AAGTTGCAAT 


ACCTTTTTAC 


AAGGTTCTTT 


TGTCTTATTT 


TTGTTTCAAC 


1200 


TGACTATATC 


TCCTATGGTT 


CTAGTTCAGA 


AGGCTAGGCT 


ATAATTATGA 


TTGATAAGAA 


1260 


GTATCATTCC 


AAGTATTGAG 


AGTGAATGTT 


TCAAAATCAT 


GGGTTTCTAT 


AATGGTCAGG 


1320 


CTGGCATTTG 


CTAGACCGCC 


ATCTTTACGA 


AGAAGTGGTT 


CTTTATAGCC 


TAGGAGAGTA 


1380 


CGAAGACTGG 


CAGTAAGATT 


GGCGCCGTGT 


CCGACAATTA 


GAATACGCTC 


AGCTGGACTA 


1440 


TCTTTTAATG 


ATTTGATAAA 


TTGGATGGTC 


CGCTGAGTTG 


TACTATAGAG 


GGATTCGGCT 


1500 


CCGAACATTC 


GAGTGTCAAA 


TTGAGCAAGA 


TTTGAACGAA 


AAGCCTGGAT 


TTGTTGCGGG 


1560 


TAAATAG 












1567 



(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:215: 



CTGTTTCCAT 


AGCATGACTT 


CTGTACTAGA 


CTTTCTTTTC 


CGAATAAATA 


GATAGAACCA 


60 


CAGAATCTAG 


TAAACCTAGA 


ATTAAAATTA 


TGGTATAATA 


TTAGCAATAA 


AAGAAATCTG 


120 


GAGGATTAGA 


ATCATGGTAT 


CAACGAAAAC 


ACAAATTGCT 


GGTTTTGAGT 


TTGACAATTG 


180 


CTTGATGAAT 


GCAGCAGGTG 


TGGCTTGTAT 


GACGATAGAG 


GAGTTAGAAG 


AGGTCAAAAA 


240 


CTCAGCGGCA 


GGAACCTTTG 


TTACTAAGAC 


AGCGACCTTG 


GACTTCCGTC 


AGGGGAATCC 


300 


TGAGCCACGC 


TACCAAGATG 


TTCCACTTGG 


TTCCATCAAC 


TCTATGGGCT 


TGCCAAATAA 


360 


TGGCTTAGAC 


TATTATTTGG 


ATTATCTTTT 


AGATTTGCAG 


GAAAAAGAGT 


CGAACCGAAC 


420 


TTTCTTCTTA 


TCTCTGGTCG 


GCATGTCTCC 


AGAGAAACCC 


ATACTATTTT 


GAAAAAAGTC 


480 


CAGAGA 












486 



(2) INFORMATION FOR SEQ ID NO: 216: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 814 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 6: 

CTCTATCAGG AAAGTTAAAT TAATTTATAG AAATATTTTA GTATTTAAGC CCTACTGTTA 60 

TAGATTCAAT ACACTATATA TGTGTTTGTC TGATAAAAAT TTCTACTCTT TTTGATTTTA 120 

AATAAGTATT AGTTTACATT ATGGTGTGAA TTGGGTTTGA TATCTCTTTT GAGGAAGTTG 180 

CCTTAGATTT TTCTGATTGT GTTTTATTGT ACAGTGTATC TTGCTTGTTT TGAACAGAAT 240 

TTTTTATGAC ATTTGTCATA TTTTCTAGTG ACAGAAGCTT CTGCCTCCTC TGATTTTAAA 300 

AGACTATAAT TGTAGTATGA AATGGGGGAA GAAGAGATGA GAAATAAAAT GATTATCGCA 360 

GTAAGTTTAG TAGTAGCAGG AGTTATGACC TATCTCATGT TTTCGGGATT GGATGAGAAT 420 

TTCTACCATT TTCCTTGGGA GGTCTTTGCT GGCTTTGGAA TCATCTTCTT GGCTTGTCCA 480 

GAGAAGGTTT GAAATTAGTA AGAGATGTGA AAAAGGAGTT TGAAAAATGA AAAAAGCAAC 540 

TATCTATTTC TTTATCGGCC TGTCACTCTT GGTATGGTTG GTAGAAATGT TTACAGGTTG 600 

GTTTGCTCAA ACCTTCCTTC ACCAGTTCAT CCGTGGTGCA TGGGGATTAG GATTTATGAT 660 

TTTTATCGCC TTTCCGATGG GAAAGGAGTT GCTGGAAGGA GAATATCATG AACATGATTA 720 

AGGTTCAAGG ACTACATAAA AATATTAAGG GCAAGGCTAT TTCAAAGGAT ATTTCCTTTG 780 

AAGTAGCAGA AGGTGAATGC GTTGCCTTGA TTGG 814 

(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1319 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 



60 



CTTGGTACGA CCGTATGGGT GGTCACTGAA AGTGGGAAAC TTCCAAGATG GGCACTGTGT 

ACGGATCCCC GTAAACTGTC GCANAAGAAC TGAAGATGAT GTTTTTACAG TTGTTTTCTT 120 

CCATGGCTTT CAAAAGGCTG ACAGTTCCAG CGATATTGTT GTCATAGTAG GCAAGAGGGA 180 

TACGTGTTGA TTCGCCAACA GCCTTCAAAC CAGCAAAGTG AATGACACCA GTCAGTTCTT 240 

CCTGCTTGAA AATATCTCTG AGGGTATCTG TGTCACNAAT ATCTGCCTCA TAGAAAGGAA 300 

TCTCAACTCC TGTGATTCTT TCAACAACTT CTAAACTCTT ACNATTGCTA TTGACAAGAT 3 60 

TATCCACCAC AACAACTTGA TGGCCTGCTT GGATCAATTC AATAACAGTG TGGGTTCCAA 420 

TAftAACCGGC ACCACCAGTT ACCAAAATCT TTTCTTGCAT CTTTTTTCCT CGATTCTCAG 480 

ATTATTTTTT CTTATTTTAA CCATTTTTGA CAGGGAATGT CATTTGCCAC CTTAAACTAC 540 
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CTGATAAAAT 


TTCAGTAAAA 


TGCTTATACT 


CTTGGAAAAT 


CCAATTCAAA 


CCACGTCAAC 


600 


GTCGCCTTGC 


CATGGGTATG 


GTTACTGACT 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAGCAG 


660 


CGCTTTGAGT 


AACCCGCGGC 


TAGTTTCNTA 


GTTTGTTCTT 


TGATTTTCAT 


TGAGTATTAT 


720 


TCGCTTTTTA 


CTCGTTTGAC 


ATAGTTTTCA 


ATTGGGTAGT 


TTTCAGGGTC 


CAAGGTCAAC 


780 


TCCTTGTCTT 


GGATCAGTTG 


AGCTAAATGG 


TAACCAATAA 


TAGGACCAGT 


TGTGAGGCCT 


840 


GATGAACCTA 


GTCCACTGGC 


TGCATAGACA 


CCAGTTAAGT 


CAGGCACCTG 


CCCAAAGAAA 


900 


GGAGAGAAAT 


CACTGGTGTA 


GGCACGGATT 


CCAACACGCT 


CAGATTTTGA 


AGTAGCTTCA 


960 


GCCAAAATCA 


GATAGTGAGT 


CAAGGTGGCC 


TCCTCCATTT 


GTTGGAGCAA 


GGTTTCATCT 


1020 


ACCGTCAAAT 


CAAATCCCAT 


GTCATTTTCG 


TGGGTAGCGC 


CTAAGGACAA 


TTTCCCACCT 


1080 


GCAAAAGGAA 


TCAAATCCCA 


CTCCCCTTCT 


GGCATGACAA 


CAGGGTAAGC 


TTCCATGTCT 


1140 


TGGACCAGCT 


GATAATCTCG 


TAGTTGTCCT 


TTTTGAGGAC 


GAACATCTAC 


TTCATAACCC 


1200 


AAGGGTTCTA 


ACATGTCCCC 


CAACCAAGCT 


CCNGTCGCCA 


AAATAACCTG 


CTCAAACTCC 


1260 


TCTTCACCAA 


TCTGGTAGCC 


TGATGCTAAC 


GGTGTCAGAG 


TCACTTTTTC 


TTTGACCAG 


1319 



(2) INFORMATION FOR SEQ ID NO: 218: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 



CTCAAAATAC 


TGTTTTGAGG 


TTGCAGATGG 


AAGCTGACGC 


GGTTTAAAGA 


GATTTTCGAA 


60 


GAGTATAAAC 


TGCTTATAAA 


ATAAAAAGGA 


GCCCTGATGG 


AACACATTAT 


TTATCAGTTT 


120 


GAAGAGGACT 


TGGCAATCCT 


TACCTTGAAC 


CGTCCTGAGG 


TCGCAAATGG 


TTTTCATATT 


180 


CCCATGTGTG 


AGGAGATTTT 


AGAAGCTCTG 


ACTTTGGCAG 


AAGAAAATCC 


AGCTGTGCAT 


240 


TTTATCTTAA 


TCAATGCCAA 


TGGAAAGGTC 


TTCTCAGTTG 


GGGGAGATTT 


GGTAGAGATG 


300 


AAGCGGGCAG 


TGGATGAGGA 


TGATATTCCA 


TCATTGACAA 


AAATCGCAGA 


GTTGGTCAAT 


360 


ACGATTTCTT 


ATAAAATCAA 


GCAAATAGCC 


AAACCTGTTT 


TTAATGGAAG 


TTGACGGTGC 


420 


TGTTGCAGGT 


GCCGCAGCGA 


ATATGGCTGT 


TGCGGCAGAT 


TTCTGTTTGG 


CGACGGATAA 


480 


GGCTAAGTTT 


ATCCCAAGCC 


CTTTGTTGGT 


GTGGTTGGCT 


CCAGATTCCA 


GGGG 


534 



(2) INFORMATION FOR SEQ ID NO:219: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



1S2 



WO 97/37026 PCT/US97/05306 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 



CTGCAGCCGG AAGTTATCTG GTTACAAAAA ATCGCCAAAC CATCACAGAT GAAAGTCTTG 60 

AATACCACTG ACCGCGTTCA AGCTATCAAG GACGATGTGG ATATTATCCA AAACAGCCTG 120 

CAAATCATTA ACCAGCAAAA AGAACTTATC AAGGAATACC AAGAAGACTT GACTTACAAG 180 

TTTAAGGTCT TGGAAAAGGA TATCCAAACT AGGACTAGCT GTGATAAAAG AAATGCAGGG 240 

AACTGAAGAT AAGTAAAAAG AGCCCGACGG CTCTTTTTTA AAACGTAAAC AAGTTATAGT 300 

GAATTGAATT TAGAATAGCA CATCATAGTT TCTAAAGCAT TGTTAGAGAT TACTTTAAAA 360 

TCCTCTTATC AATTTGTTCA TATTCTATTT CAATCTACTA TATATATCCC ATCAACTATG 420 

CATCATAATT TAGGTAACTC ATACTCAATA AAAATCAAAA AGCAAACTAG AAAGCTAGGC 480 

ACAGACTGCT CAAAACACCG TTTTAAGGTT GTGGATAGAA CTGACGAAGT CAGTAACCAT 540 

ACCCATGACA AGCCGACGCT GACGTGGTTT GAAGAAATTT TTGAAGAGTA TCATTCACCA 600 

TTTCACAACT AAATCGAAGA CTTTCTGCCG TAATGAAAAC ACCTGAAACA GCTTGGTTTC 660 

AGCTGTCCGG AAACTTTAAG ACTTAGGTTC AAAGTTTAGG TATGGAACTT CGAAGGAGGT 720 

CGCTACCGTC CGTCATTACT TAGNGAAGNC TTAAAAAATC TATAAAGTAA AAAGAGCCCG 780 

ACAG 784 



(2) INFORMATION FOR SEQ ID NO: 220: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:220: 



CTACTCGTTT TAGTATAGTA AACTGAATCT AGACTAGCAC AATGCNGCTT CTANAGCGTT 60 

TCGAAACTAG AATAG3ACAA CATAACTGCT AAAAGATTTC TATAAATTCA TTTGATTTTC 120 

CTAATCAATT TGTTCGTATC CTANTTCACT C C ACT ATAA A ATATTCTTAT CAATTGATTT 130 

GNATGCCAAA ATTCCATCGT TCAGGTCTTA T TTC ATTNGG CTNTAAAAAA CTCTCCTGGG 240 

NAATCCCAGG AGAGNATGAT TACNTATTTG AACTTGAATC CTTCGTANAT AAGCTCTGTT 300 

TTTGGATTTT GTTTCTTAAT CTGTTTGGCA AGTGCCTTCA TCATAGAAAT AGGACCACAC 3 60 

ATATAGACGG TTGCATGTTC GGGCACTTTT TTTGTTCAAA ATTAAGATAG CCGTCTTTCG 420 

TACTGTCGAT TAGATGGAGT TCAAAATTAG GATTTTTCTG AGCATAGTTA CGGAGTAAAT 480 

CTAGGTAGAC TGCATTTTCA TCTCCACGGA AGCTATAGTA GAAGTGAACC TGTTTATCTA 54 0 

AAATAGGATG TTCACGGATG TAAGAGATGA AGGGGGTGAT CCCAATACCT CCAGCAATCC 600 

AAACCTGATT TTCTCGTCCT TCTTCTATGA TCATGTGTCC GTAAGCTCTG TCTAGGGTTA 660 

CTTTGCTGCC GGCTTGAAGA TTATCATAGA T/iTTTCTTGG TATGGTCGCC TGAATTTTTA 720 

ACAGTAAAGT TAAAGAGTTT GAACCCATGA ACTCCCTGAG AATAGAAAAG GGATGCCGGA 780 

1*3 
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ACCACTTTCA AAGCCTTCTT GGAAAATCTT TAGAAAGGCA AATTGTCCTG ATTGATAGTT 840 
GAAAGGTCTG CTAAGATGGA TTTGAATTTC TCTAGTATCG TGATTTAAGC GTTTGAGATG 900 
GGTAATTTTC CCTAGATAGG GGAAGGAAAT CTTTTGATAT AGAAAAATGA TATAAAAACC 960 
AG 962 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 



CTCTTTCCAC 


TATTATAACA 


CGAAAGAAAG 


AATTGTCAGG 


GAACTGTACA 


GATTTCTTTT 


60 


CTATCTATTT 


ATAACTCAAT 


GAAAATCAAA 


GAGCAAACTA 


GGAAACTAGC 


CGCAGGCTGT 


120 


AATTGAGTAC 


GGCAAGGCGA 


CGTTGACGCG 


ATTTGAATTT 


GATTTTCGCA 


GAGTATTATT 


180 


CGTAAAAAAT 


CTCAAAAAGC 


CTACCTTTCG 


GTAGACTTAG 


TTTGTTTCTA 


TTCTAATCGG 


240 


CACTCTTCCA 


AAATTTTGCT 


CTGCTATACT 


TGGCTTTCCT 


AGTTGGTAAA 


TCTGGTCAAC 


300 


CTTTTGAGTC 


ATAGCATCCC 


AAGGTTCTTT 


GCCAATTCGG 


CTGACTAGAT 


TGACCTGTCC 


360 


TTTCAGAGAC 


TTGAGATGTT 


GCCTGCCTTT 


TTCAGTAAAT 


CCAAGGACAT 


GAATGGCTTC 


420 


TGGCAAGTCA 


CTTTCTCTAG 


CCTGCATCAA 


AATATAGGTC 


AATAGGCGTC 


TGACACGCGC 


480 


CTTGGTGTAA 


CGTTTGGTAG 


TCACTAACTC 


GACCAATTCT 


TCCACAGACT 


GGGCTGTTTT 


540 


AATAGCTTCT 


TTAATGCGCA 


CAGCCATTTC 


TTGATTGACC 


TGATAGATGG 


TGGTTAGGTC 


600 


GGGATTTGAC 


AAGATTTGAT 


AGCGGAGCAA 


GGGAAAATAG 


TCTTCCCAGA 


TCACCTTACT 


660 


GGCTTGCTCA 


AATAGGGCAA 


CAGAAGGCAT 


AAAACGTTCT 


AAGAAATCTT 


GGTCCTTCTG 


720 


ATGTTGACGG 


AGGGCTGTCG 


CCGAGGCAAA 


GTCCACATCT 


TTATTCACAG 


AATGGTAACC 


780 


TGCCCCCTGA 


CGCTGAATCG 


GATGCAGCTT 


GATATTCCGT 


CCTGCAACCG 


CCTTGGCATA 


840 


GGCCAAAACA 


AAGAACATGA 


TTGGGGTGTA 


TTACCTGAAA 


AATCAAGACC 


AGCAAATTCC 


900 


TTCCACATAG 












910 



(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 









i i- i uvjuuv* i i 


GvJwVTtTCTCA 


GTCGCTGGAG 


60 




luuuU i. lu X i 


/"•ATPATWTAn 


t Art a r* a a a r*T 


CTTCATACTC 


CAACACTTGC 


120 


CCATCAAAAU 


CuAAU L. re a r 


CTATTTTTTC 


ATQ-AATCTGC 


AATTTAGCTG 


ATTTTTCTTT 


180 


TTTACCATTT 


ACAGTCACGC 


GCCCAGCCTT 


GAGCAAGTTT 


TTGACCTCAG 


TCCGACTTCC 


240 


CACCGCACAG 


GCAACTAAAA 


ATTTATCTAA 


TCTCATAGAA 


CTATTATATC 


ATATCAAAAG 


300 


GAGGCTGGTA 


CAATGACCAA 


CCTCCTTTTC 


GTTTCATACT 


CTTCAAAAAT 


CTCTTCAAAC 


360 


CGCGTCAACG 


TCGCCTTGCC 


GTATATATGT 


TACTGACTTC 


GTCAGTTCTA 


TCTGCAACCT 


420 


CAAAGCAGTG 


CTTTGAGCAA 


CCTGCGGCTA 


GTTTCCTAGT 


TTGCTCTTTG 


ATTTTCATTG 


480 


AGTATCAGAT 


TTAGGAAATT 


AACTTCCTCG 


TCTCCAAAAA 


ATAGCTAAGA 


CAATCATGGC 


540 


ACCTAAAACA 


GCTGGGATAA 


TAGCTGTTCC 


TGATAAAACT 


GGCCCCCAAG 


TTCCAAAGAG 


600 


CAAGTGACCT 


AGAAAGGCTC 


CGATCCAACC 


GAGAAACATT 


TTTCCAAAAC 


ATCCCATTCG 


660 


CTCTCCACGA 


TTGGTCATAG 


CACCTGCTAA 


AAATCCCACT 


AGGAGACCAA 


CGAACATACT 


720 


TCCTAACATA 


TTATCTCCTT 


AATTTGCCCA 


ATTCCCATTA 


CGGAAAAGAA 


GTACTCGCGT 


780 


TCCATCCTCA 


CGAATACCAT 


CGATATCCAT 


TTGGTTAGAA 


CCAATCATAA 


AGTATACGTG 


840 


AACATCTGAA 


CGGTTAAGCC 


CTGCAG 








866 



(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:223: 



CTTCAGGATG 


TGGGAAAACG 


ACCCTTCTTC 


GTATGATTGC 


AGGTTTCAAC 


AGTATCAAAG 


60 


ATGGAGAATT 


TTACTTCGAT 


GATACAAAAA 


TCAATAATAT 


GGAACCCAGC 


AAACGCAATA 


120 


TCGGGCTGGT 


TTTCCAAAAC 


TACGCTATTT 


TCCCACATTT 


GACTGTCCGA 


GACAACGTTG 


180 


CTTTTGGTCT 


TATGCAAAAG 


AAGGTTCCAA 


AAGAAGAATT 


GATTCAACAG 


ACCAACAAGT 


240 


ATCTTGGACT 


CATGCAAATT 


GCTCAATATG 


CGGATCGAAA 


GCCCGATAAA 


CTCAGTGGTG 


300 


GACAACAACA 


ACGTGTCACC 


TTGGCATGCG 


TCTTAGCGGT 


TAATCCAAGT 


GTTCTCCTCC 


360 


ATGGACGAAG 


CCACTTAGTA 


ATCTGGAGGC 


CAAACTTCGC 


TTGGATATGC 


GTTCAAGCCC 


420 


ATCCCGAGAA 


ATCCAACCAC 


GAAGTTGGGG 


AATTACAACT 


GTTTATGTAA 


CCCACGACCA 


480 


AGAAGGAGCC 


ATGGCTATTT 


CAGACCAAAT 


TGCCTGTTAT 


GAAAGATGGG 


GTGATCCAAC 


540 


AAATCGGCCG 


ACCAAAAGAA 


CTGTATCATA 


AACCAGCTAA 


TGAGTTAGTG 


GCAACCTTTA 


600 


TCGGACGCAC 


AAATATTATC 


CCTGCCAATC 


TTGAAAAACG 


GAGCGACGGC 


GNTTATATCG 


660 


TCTNTTCAGA 


TGGANANGCC 


CTTCGAATGA 


TAG 






693 



(2) INFORMATION FOR SEQ ID NO: 224: 
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(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 : 

Met Ser Met Phe Thr Met Val Lys Met Asn Pro Leu Arg Gly Leu lie 

15 10 15 

Cys Asn Leu Lys Gly Asn Lys lie Thr Ala Leu lie Gly Pro Ser Gly 

20 25 30 

Ser Gly Lys Ser Thr Tyr Leu Arg Ser Leu Asn Arg Met Asn Asp Thr 

35 40 45 

lie Asp lie Ala Lys Val Thr Gly Gin lie Leu Tyr Arg Gly lie Asp 

50 55 60 

Val Asn Arg Pro Glu lie Asn Val Tyr Glu Met Arg Lys His lie Gly 
65 70 75 80 

Met Val Phe Gin Arg Pro Asn Pro Phe Ala Lys Ser lie Tyr Arg Asn 

85 90 95 

lie Thr Phe Ala His Glu Arg Ala Gly Val Lys Asp Lys Gin Val Leu 

100 105 110 

Asp Glu lie Val Glu Thr Ser Leu Ser Gin Ala Ala Leu Trp Asp Gin 

115 120 125 

Val Lys Asp Asp Leu His Lys Ser Ala Leu Thr Leu Ser Gly Gly Gin 

130 135 140 

Gin Gin Arg Leu Cys lie Ala Arg Ala lie Ser Val Lys Pro Asp lie 
145 150 155 160 

Leu Leu Met Asp Glu Pro Ser Leu Ser Leu Gly Ser Asp Cys Asp His 

165 170 175 

Ala Thr Arg Arg Asp His Val 
180 

(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE; protein 



PCT/US97/05306 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 



Met lie Lys lie Ser Asn Leu Ser Lys Ser Phe Ser Gly Gin Thr Val 

15 10 is 

Leu Asp His Leu Asn Leu Asp lie Gin Lys Gly Glu Val Val Ala Leu 

20 25 30 

lie Gly Ser Ser Gly Ala Gly Lys Ser Thr Phe Leu Arg Ser Leu Asn 

35 40 45 

Tyr Leu Glu Thr Pro Asp Ser Gly Ser lie Gin He Asp Gly Phe Ser 

50 55 60 

Val Asp Phe Ser Lys He Thr Gin Glu Glu He Leu Ala Leu Arg Arg 
65 70 75 80 

Lys Leu Ser Met Val Phe Gin Gin Phe Asn Leu Phe Glu Arg Arg Thr 

85 90 95 

Ala Leu Asp Asn Val Lys Glu Gly Leu Val Val Val Lys Lys Leu Ser 

100 105 no 

Asp Gin Glu Ala Thr Lys lie Ala Lys Glu Glu Leu Ala Lys Val Gly 

115 120 125 

Leu Ser Asp Arg Glu Asn His Tyr Pro Arg His Leu Ser Gly Gly Gin 

130 135 140 

Lys Gin Arg Val Ala Leu Ala Arg Ala Leu Ala Met Lys Pro Asp Val 
145 150 155 160 

Leu Leu Leu Asp Glu Pro Thr Ser Ala Leu Asp Pro Glu Leu Val Gly 

165 170 175 

Glu Val Glu Lys Ser He Ala Asp Ala Ala Lys Ser Gly Gin Thr Met 

180 185 190 

He Leu Val Ser His Asp Met Pro Phe Val Ala Gin Val Ala Asp Lys 

195 200 205 

He Leu Phe Leu Asp Lys Gly Lys He He Glu Ser Gly Thr Pro Asp 

210 215 220 

Glu He He His Thr Pro Lys Glu Glu Arg Thr Lys Glu Phe Phe Thr 
225 230 235 

Ser Tyr Lys Arg Thr Tyr He 
245 



240 



(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 anino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLCGY: linear 
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PCT/US97/05306 



(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

Met Phe Ser Leu Arg Ser Val Phe Asp Gly He Pro Arg He Val Gin 

15 10 15 

Gin Leu Pro Thr Thr He Met Leu Thr He Gly Gly Ala Leu Phe Gly 

20 25 30 

Leu Val Leu Ala Leu Leu Phe Ala He Val Lys He Asn Arg Val Lys 

35 40 45 

He Leu Tyr Pro Leu Gin Ala Phe Phe Val Ser Phe Leu Lys Gly Thr 

50 55 60 

Pro He Leu Val Gin Leu Met Leu Thr Tyr Tyr Gly He Pro Leu Ala 
65 70 75 80 

Leu Lys Ala Leu Asn Gin Gin Trp Gly Thr Gly Leu Asn He Asn Ala 

85 90 95 

He Pro Ala Ala Ala Phe Ala He Val Ala Phe Ala Phe Asn Glu Ala 

100 105 110 

Ala Tyr Ala Ser Glu Thr He Arg Ala Ala He Leu Ser Val Asn Pro 

115 120 125 

Gly Glu He Glu Ala Ala Arg Ser Leu Gly Met Thr Arg Ala Gin Val 

130 135 140 

Tyr Arg Arg Val He He Pro Asn Ala Ala Val Val Ala Thr Pro Thr 
145 150 155 160 

Leu He Asn Ser Leu He Gly Leu Thr Lys Gly Thr Ser Leu Ala Phe 

165 170 175 

Ser Ala Gly Val Val Glu Val Phe Ala Gin Ala Gin He Leu Gly Gly 

180 185 190 

Ala Asp Tyr Arg Tyr Phe Glu Arg Phe He Ser Val Ala Leu Val Tyr 

195 200 205 

Trp Val Val Asn He Gly He Glu Ser Leu Gly Arg Phe He Glu Arg 

210 215 220 

Lys Met Ala He Ser Ala Pro Asp Thr Val Gin Thr Asp Val Lys Gly 
225 230 235 240 

Asp Leu Arg 



(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 63 amino acids 
(BI TYPE: amino acid 
{C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

Met Tyr Arg lie Asp Asp Asp Glu Gin Phe Val Leu Asp Phe Leu Lys 

Gin Glu Lys Val Leu Leu Val His Gly Arg Gly Phe Asn Trp Gin Glu 

20 25 3 0 

Pro Asp His Phe Arg He Val Tyr Leu Pro Arg Val Asp Glu Leu Ala 

35 40 45 

Gin He Gin Glu Lys Met Thr Arg Phe Leu Lys Gin Tyr Arg Arg 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

Met Asn Asp Glu Ala Ser Lys Gin Leu Thr Asp Ala Arg Phe Lys Arg 

1 5 10 15 

Leu Val Gly Val Gin Arg Thr Thr Phe Glu Glu He Leu Ala Val Leu 

20 25 30 

Lys Thr Ala Tyr Gin Leu Lys His Ala Lys Gly Gly Arg Lys Pro Lys 

35 40 45 

Leu Ser Leu Glu Asp Leu Leu Met Ala Thr Leu Gin Tyr Val Arg Glu 

50 55 " 60 

Tyr Arg Thr Tyr Glu Gin lie Ala Ala Asp Phe Gly He His Glu Ser 
65 70 75 80 

Asn Leu He Arg Arg Ser Gin Trp val Glu Val Thr Leu Val Gin Ser 

85 90 95 

Gly Val Thr He Ser Arg Thr Pro Leu Ser Ser Glu Asp Thr Val Met 

100 105 no 

He Asp Ala Thr Glu Val Gin He Asn Arg Pro Lys Lys Arg He Ser 
115 120 125 
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Glu Ser Phe Trp 
130 
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(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

Met Arg Arg Lys Tyr Lys Ser lie Ala Leu Lys Lys Glu Leu Ala Asn 

15 10 15 

His Ser Gly Lys Lys Lys Phe His Ala Met Lys Ala Gin Ala He Val 

20 25 30 

Thr Ser Gin Gly Arg He Val Ser Leu Asp He Ala Val Asn Tyr Ser 

35 40 45 

His Asp Met Lys Leu Phe Lys Met Ser Cys Arg Asn He Gly Gin Ala 

50 55 60 

Gly Lys He Leu Ala Asp Ser Gly Tyr Gin Gly Pro Met Lys He Tyr 
65 70 75 80 

Pro Gin Ala Gin Thr Pro Arg Lys Ser Ser Lys Leu Lys Pro Leu He 

85 90 95 

Ala Glu Asp Lys Ala Tyr Asn His Ala Leu Ser Lys Glu Arg Ser Lys 

100 105 110 

Val Glu Asn He Phe Ala Lys Val Lys Thr Phe Lys Met Phe Ser Thr 

115 120 125 

Thr Tyr Arg Asn His Arg Lys Arg Phe Gly Leu Arg Met Asn Leu He 

130 135 140 

Ala Gly He He Asn Tyr Glu Leu Gly Phe 
145 150 

(2) INFORMATION FOR SEQ ID NO:230: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 amino acids 
{B) TYPE: amino acid 
(C) STRANDEDNESS: single 
{DJ TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



PCT/US97/05306 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:230: 



Met Val Ser Ser Ser Gly Ser Glu 

1 5 
Gin Leu lie Ala Glu Lys Val Ser 
20 

Lys Asp Val Arg Lys lie Pro Thr 
35 40 

Arg 



Phe Gin Ser Gly Trp Gin Glu His 

10 15 

Lys Thr Leu Asp Lys Thr Phe Asp 

25 30 

Ser Pro Val Leu Ser Lys lie Cys 
45 



(2) INFORMATION FOR SEQ ID NO: 231: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 



Met lie Lys lie Leu Ala Ala Cys 

1 5 
Gin lie Lys Ser Ala Leu Glu Glu 
20 

Val His Cys Asp Ala Val Met Val 

35 40 
Lys Gly Tyr Asp lie Phe Thr Pro 

50 55 
Glu Pro Gly lie Pro Val He Glu 
65 70 
Pro Ala Met Ser Ala Pro Val Phe 
85 

Gin Asn Met Val 
100 



Gly Ala Gly Val Asn Ser Ser His 

10 15 
Glu Leu Ser Asn Arg Gly Tyr Asp 
25 30 
Lys Asp Val Asn Glu Asp Leu Met 
45 

He Ala Ala Thr Asp Leu Gly Phe 
60 

Ala Gly Pro He Leu Phe Arg He 

75 80 
Asp Asn He Arg Leu Pro Ala Lys 
90 95 



(2) INFORMATION FOR SEQ ID NO: 232: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 49 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

Met Asn Thr Lys Met Met Ser Gin Phe Ser Val Met Asp Asn Glu Met 

15 10 is 

Leu Ala Cys Val Glu Gly Gly Asp lie Asp Trp Gly Arg Glu He Ser 

20 25 30 

Cys Ala Ala Gly Val Ala Tyr Gly Ala He Asp Gly Cys Ala Thr Thr 
35 40 45 

Val 



(2) INFORMATION FOR SEQ ID NO:233: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



|xi) SEQUENCE DESCRIPTION; SEQ ID NO: 233: 



Met Arg Glu Lys Glu He Phe Asp 

1 5 
Arg Gin Gly Glu Asp Phe Val Val 
20 

Asp Leu Asp Ala Asp Ser Val Asp 

35 40 
Glu Asp Glu Phe Ser He Glu He 

50 55 
Gin Ser Val Gly Asp Val Val Lys 
65 70 



Ser He Val Thr He He Gin Glu 

10 15 
Thr Glu Ser Leu Ser Leu Lys Asp 
25 30 
Leu Met Glu Phe He Leu Thr Leu 
45 

Ser Asp Glu Glu He Asp Gin Leu 
60 

Tie He Gin Gly Lys 
75 



(2) INFORMATION FOR SFQ ID NO: 234: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 40 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 4: 

Met Ala His Gly Asp Leu Leu Tyr His Asp Gly Leu Phe Phe Ser Ala 

15 10 15 

Lys Lys Glu Asp Gly Thr Tyr Asp Phe His Glu Asn Phe Glu Tyr Val 

20 25 30 

Thr Pro Trp Leu Lys Gin Val Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 235 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

Met Ser Leu He Thr His Arg Arg Phe He Ser Cys Asn Glu Asn He 

15 10 15 

Lys His Tyr Lys Arg Leu lie Asp Lys Ala Glu Lys Cys Val Asn Asp 

20 25 30 

Leu Met Ala Glu Leu Asn Ser Val He Thr Thr Val Thr Gly He Glu 

35 40 45 

Asn Arg Leu Gly Ala Val He Leu Ala Glu He Arg Asn He His Ala 

50 55 60 

Phe Asp Asn Pro Ala Gin Leu Gin Ala Phe Ala Gly Leu Asp Ser Ser 
65 70 75 80 

He Tyr Gin Ser Gly Gin lie Asp Leu Val Gly Arg Met Val Lys Arg 

85 90 95 

Gly Ser Leu His Leu Arg 

100 

(2) INFORMATION FCR SEQ ID NO:236: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

Met He Arg Ala Val Phe Phe Ser He Asp Asn Asp Met Glu Glu Ala 

15 10 15 

Ala Arg Ser Met Gly Ala Ser Ser Phe Tyr Thr Met Val Arg Val He 

20 25 30 

He Pro Tyr He Leu Pro Val Val Leu Ser Val Val Val Leu Asn Phe 

35 40 45 

Asn Ser Leu Leu Ser Asp Tyr Asp Leu Ser Val Phe Leu Tyr His Pro 

50 55 60 

Leu Phe Gin Pro Leu Gly He Val He Lys Gin Ser Thr Asp Glu Thr 
65 70 75 80 

Ala Thr Leu Asn Ala Gin Ala Met Met Phe Val Tyr Ser Val He Leu 

85 90 95 

Met He Met Ser Ser Ala Ala Leu Tyr Leu Ser Ser Leu Phe Gin Gly 

100 105 110 

Lys Arg Gly Lys Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

Met Glu Lys Cys Tyr Thr Asp Val Thr Glu Phe Ala He Pro Ala Ser 

15 10 15 

Thr Gin Lys Leu Tyr Leu Ser Pro Val Leu Asp Gly Phe Asn Ser Glu 

20 25 30 
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lie lie Ala Tyr Asn Leu Ser Thr Ser Pro Asn Leu Glu Gin Val Gin 

35 40 45 

Thr Met Leu Glu Gin Ala Phe Thr Glu Lys His Tyr Glu Asn Thr lie 

50 55 60 

Leu His Ser Asp Gin Gly Trp Gin Tyr Gin His Asp Ser Tyr His Arg 
65 70 75 so 

Phe Leu Arg Val Arg Glu Phe Lys His Leu Cys His Ala Arg Glu Thr 

85 90 95 

Ala Lys Thr Thr Val 
100 



(2) INFORMATION FOR SEQ ID NO: 238: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 



Met Glu He Leu Thr Thr Gin Gin 

1 5 
Leu Ala Ser Pro Leu Gly Leu Gin 
20 

Lys Ser Gly Ser Tyr Thr Val Pro 

35 40 
Asn Thr Glu Pro Leu Arg He Asp 

50 55 
Ala Val Glu Asn Leu His Val Val 
65 70 
Val Ser Asp His Tyr Gly Leu Asn 
B5 



Val Arg Glu Gly Tyr Gin Ala He 

10 15 
Asp Ala Phe Glu Val Ala Gin Glu 
25 30 
Pro Glu He Asp Gly Trp Lys Gly 
45 

Tyr Val Phe Thr Thr Lys Glu Leu 
60 

Phe Asp Gly Asn Lys Ser Pro Gin 

75 80 
Ala Met Leu Asn Trp Lys 
90 



(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 



Met Leu Lys His Leu Asn Leu Lys Gly His Leu Leu Thr Ala He Ser 

15 10 15 

Tyr Met He Pro He Val Cys Gly Ala Gly Phe Leu Val Ala He Gly 

20 25 30 

Leu Ala Met Gly Gly Gly Val Pro Asp Ala Leu Val Ala Gly Lys Phe 

35 40 45 

Thr He Tip Asp Ala Leu Ala Thr Met Gly Gly Lys Ala Leu Gly Leu 

50 55 60 

Leu Pro Val Val He Ala Thr Gly Leu Ser Tyr Ser He Ala Gly Lys 
65 70 75 80 

Pro Gly He Ala Pro Gly Phe Val Val Gly Leu He Ala Asn Ser Val 

85 90 95 

Gly Ser Gly Phe He Gly Gly He Leu Gly Gly Tyr He Val Gly Phe 

100 105 HO 

Leu Val Gin Ala He He Lys Lys Val Lys Val Pro Asn Trp He Lys 

115 120 125 

Gly Leu Met Pro Thr Leu He He Pro Phe Val Pro Leu Trp 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 240: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:240: 

Met He Ala Leu Gly Trp Ser Asn He Gly Ala Ala He Ala Pro Asp 

1 5 10 15 

Ala Ala Leu Ala Ser Val Ala Ala Ala lie He Met Val Leu Gly Gly 

20 25 30 

Asp Phe Thr Lys Thr Gly He Gly Val Ala Gin Ala Val Ala lie Pro 

35 40 45 

Leu Ala Val Ala Gly Leu Phe Leu Thr Met lie Val Arg Thr He Ser 
50 55 60 
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Val Gly Leu Val His Thr Ala Asp Ala Ala Ala Lys Lys Gly Asp Phe 
65 70 75 80 

Gly Ala Val Glu Arg Ala His Phe He Ala Leu Leu Phe Gin Gly Leu 

85 90 95 

Arg He Ala Leu Pro Ala Ala Leu Leu Leu Met Val Pro Thr Glu Thr 

100 105 no 

Val Gin Ser He Leu Ser Ala Met Pro Asp Trp Leu Lys Asp Gly Met 

115 120 125 

Ala He Gly Gly Gly Met Val Val Ala Val Gly Tyr Ala Met Val He 

130 135 140 

Asn Met Met Ala Thr Arg Glu Val Trp Pro Phe Phe Ala Leu Gly Phe 
145 150 155 160 

Val Leu Ala Ala Val Ser Asp He Thr Leu He Gly Phe Gly Ala He 

165 170 175 

Gly Val Ala He Ala Leu lie Tyr Leu His Leu Ser Lys Thr Gly Gly 

180 185 190 

Asn Gly Gly Gly Gly Ala Ala Thr Ser Asn Asp Pro He Gly Asp He 

195 200 205 

Leu Glu Asp Tyr 
210 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:241: 

Met Leu Asp Val Glu Ala He Arg Lys Asp Phe Pro He Leu Asp Gin 

1 5 10 15 

He Val Asn Asp Glu Pro Leu Val Tyr Leu Asp Asn Ala Ala Thr Thr 

20 30 
Gin Lys Pro Leu Val Val Leu Lys Ala lie Asn Ser Tyr Tyr Glu Gin 

35 40 45 

Asp Asn Ala Asn Val His Arg Gly Val His Thr Leu Ala Glu Arg Ala 

50 55 60 

Thr Ala Ser Leu 
65 
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(2) INFORMATION FOR SEQ ID NO: 2 42: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

Met Thr Lys Leu Leu Asn Lys Lys Trp Lys Val Lys lie Met Lys Gin 

15 10 15 

lie Leu Leu Val Cys Asn Ala Gly Met Ser Thr Ser Met Leu Val Lys 

20 25 30 

Lys Met Gin Gin Ser Ala Thr Glu Arg Gly He Glu He Ser He Gin 

35 40 45 

Ala Lys Ser Met Thr Glu Ala Lys Lys Asn He His Glu Ala Asp Val 

50 55 60 

He Leu He Gly Pro Gin He Arg Tyr Glu Leu Leu Ala Val Lys Glu 
65 70 75 80 

He Ala Gly Asn He Pro Val Asp Thr He Asp Met Arg Asp Tyr Gly 

85 90 95 

Met Met Asn Gly Ala Lys Val Leu Glu Gin Ala Leu Glu Trp He Gly 
100 105 HO 

Glu lie Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 243: 

<i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 99 airino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(X,i) SEQUENCE DESCRIPTION : SEQ ID NO:243: 

Met Val Leu Met Gin Lys Met His Gly Lys Glu Leu He Thr Leu Asn 
15 10 15 
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Gin Thr Val Lys Trp Tyr Lys Val 
20 

Lys Pro Trp Tyr Tyr Leu Lys Ser 

35 40 
Trp Gin Gly Asn Tyr Tyr Leu Lys 

50 55 
Glu Trp Val Tyr Asp Ala Thr Tyr 
65 70 
Asp Gly Ser Tyr Ala Tyr Ser Thr 
85 

lie Gly Trp 
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Ser Gly Phe Met He Leu Leu Thr 
25 30 
Asp Gly Ser Tyr Ala Arg Asn Ala 
45 

Ser Asp Gly Lys Met Ala Val Asn 
60 

Gin Ala Trp Tyr Tyr Leu Thr Ser 

75 80 
Trp Gin Gly Asn Tyr Tyr Pro Lys 
90 95 



(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

Met Leu Thr Val His Gly Lys Glu He Thr He Leu Lys Ser Asp Gly 

15 10 15 

Lys Met Ala Val Asn Glu Trp Val Asp Gly Gly Arg Tyr Tyr Val Gly 

20 25 30 

Ala Asp Gly Val Trp Lys Glu Gly Gin Ala Ser Thr Ala Ser Pro Ser 

35 40 45 

Asn Asp Ser Asn Ser Glu Tyr Ser Cys Cys Phe Arg Lys Gly Lys Lys 
50 55 60 

Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 



Met Asn Asn Asn Phe Asn Asn Phe Asn Asn Met Asp Asp Leu Phe Asn 

15 10 15 

Gin Leu Met Gly Gly Met Arg Gly Tyr Ser Ser Glu Asn Arg Arg Tyr 

20 25 30 

Leu lie Asn Gly Arg Glu Val Thr Pro Glu Glu Phe Ala His Tyr Arg 

35 40 45 

Thr Thr Gly Gin Leu Pro Gly Asn Ala Glu Thr Asp Val Gin Met Pro 

50 55 60 

Gin Gin Ala Ser Gly Met Lys Gin Asp Gly Val Leu Ala Lys Leu Gly 
65 70 75 80 

Arg Asn Leu Thr Ala Glu Ala Arg Glu Gly Lys Leu Asp Pro Val lie 

85 90 95 

Gly Arg Asn Lys Glu lie Gin Glu Thr Ser Glu He Leu Ser Arg Arg 

100 105 110 

Thr Lys Asn Asn Pro Val Leu Val Gly Asp Ala Gly Val Gly Lys Thr 

115 120 125 

Ala Val Val Glu Gly Leu Ala Gin Ala He Val Asn Gly Asp Val Pro 

130 135 140 

Ala Ala He Lys Asn Lys Glu He He Ser He Asp He Ser Gly Leu 
145 150 155 160 

Glu Ala Gly Thr Gin Tyr Arg Gly Ser Phe Glu Glu Asn Val Gin Asn 

165 170 175 

Leu Val Asn Glu Val Lys Glu Ala Gly Asn He He Leu Phe Phe Asp 

180 185 190 

Glu He His Gin lie Leu Gly Ala Gly Ser Thr Cys Gly Asp Ser Gly 

195 200 205 

Ser Lys Gly Leu Ala Asp lie Leu Ser Gin Ser He Ser Leu Val Glu 
210 215 220 

Asn 

225 



(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 148 aniino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

Met Thr lie Phe Pro Asp Asp Phe Leu Trp Gly Gly Ala Val Ala Ala 

1 5 10 is 

Asn Gin Val Glu Gly Ala Tyr Asn Glu Asp Gly Lys Gly Leu Ser Val 

20 25 30 

Gin Asp Val Leu Pro Lys Gly Gly Leu Gly Glu Ala Thr Glu Asn Pro 

35 40 45 

Thr Glu Asp Asn Leu Lys Leu lie Gly lie Asp Phe Tyr His Lys Tyr 

50 55 60 

Lys Glu Asp lie Ser Leu Phe Ser Glu Met Gly Phe Asn Val Phe Arg 
65 70 75 80 

Thr Ser lie Ala Trp Ser Arg He Phe Pro Lys Gly Asp Glu Glu Glu 

85 90 95 

Pro Asn Glu Ala Gly Leu Lys Tyr Tyr Asp Glu Leu Phe Asp Glu Leu 

100 105 HO 

His Ala His Gly He Glu Pro Leu Val Thr Leu Ser His Tyr Glu Thr 

115 120 125 

Pro Leu Tyr Leu Ala Arg Lys Tyr His Gly Trp He Asp Arg Lys Asn 

130 135 140 

Asp Ser Phe Leu 
145 

(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

Met He Glu Tyr Lys Asn Val Ala Leu Arg Tyr Thr Glu Lys Asp Val 

15 10 15 

Leu Arg Asp Val Asn Leu Gin He Glu Asp Gly Glu Phe Met Val Leu 

20 25 30 

Val Gly Pro Ser Gly Ser Gly Lys Thr Thr Met Leu Lys Met He Asn 

35 40 45 

Arg Leu Leu Glu Pro Thr Asp Gly Asn He Tyr Met Asp Gly Lys Arg 
50 55 60 
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lie Lys Asp Tyr Asp Glu Arg Glu Leu Arg Leu Ser Thr Gly Tyr Val 
65 70 75 80 

Leu Gin Ala lie Ala Leu Phe Pro Asn Leu Thr Val Ala Glu Asn lie 

85 90 95 

Ala Leu He Pro Glu Met Lys Gly Trp Ser Lys Glu Glu He Thr Lys 

100 105 110 

Lys Thr Glu Glu Leu Leu Ala Lys Val Gly Leu Pro Val Ala Glu Tyr 

115 120 125 

Gly His Arg Leu Pro Ser Glu Leu Ser Gly Gly Glu Gin Gin Arg Val 

130 135 140 

Gly He Val Arg Ala Met He Gly Gin Pro Lys He Phe Leu Met Asp 
145 150 155 160 

Glu Pro Phe Ser Ala Leu Asp Ala He Ser Arg Lys Gin Leu Gin Val 

165 170 175 

Leu Thr Lys Glu Leu His Lys Glu Phe Gly Met Thr Thr He Phe Val 

180 185 190 

Thr His Asp Thr Asp Glu Ala Leu Lys Leu Ala Asp Arg He Ala Val 

195 200 205 

Leu Gin Asp Gly Glu He Arg Gin Val Ala Asn Pro Glu Thr He Leu 

210 215 220 

Lys Val Pro Ala Thr Asp Phe Val Ala Asp Leu Phe Gly Gly Ser Val 
225 230 235 240 

His Asp 



(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

Met Pro Phe Val Pro Gly He Ma Leu Thr Asn Ala Val Arg Asp He 

15 10 « 

Met Thr Asn His He Asn Ser Gly Met Ser Lys Met Phe Glu Ser Leu 

20 25 30 

Leu He Thr Leu Ala Leu Gly Ala Gly Thr Ser Val Ala Leu Val Leu 
35 40 45 



202 



WO 97/37026 

Met Asn 
50 
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(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 249: 

Met Tyr Thr He Arg Met Leu Lys Met Gly Ser Glu Ala Ala Ala Lys 

15 10 15 

Ser Ala Gin Glu His Gly Leu Lys Ser Val Glu Val Thr Val Lys Gly 

20 25 30 

Pro Gly Ser Gly Arg Glu Ser Ala He Ser Cys Ala Cys Cys Arg Trp 
35 40 45 

Ser 



(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

Met He Glu Phe Glu Lys Pro Asn He Thr Lys He Asp Glu Asn Lys 

15 io 15 

Asp Tyr Gly Lys Leu Val He Glu Pro Leu Glu Arg Gly Tyr Gly Thr 

20 25 30 

Ala Leu Gly Asn Ser Leu Arg Arg Val Leu Leu Ala Ser Leu Pro Gly 

35 40 45 

Ala Ala Val Thr Ser lie Asn He Asp Gly Val L<su Kis Glu Phe Asp 
50 55 60 
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Thr Val Pro Gly Val Arg Glu Asp Val Met Gin He He Leu Asn He 
65 70 75 80 

Lys Gly He Ala Val Lys Ser Tyr Val Glu Asp Glu Lys He He Glu 

85 90 95 

Leu Asp Val Glu Gly Pro Ala Glu Val Thr Ala Gly Asp He Leu Thr 

100 105 110 

Asp Ser Asp He Glu He Val Asn Pro Asp His Tyr Leu Phe Thr He 

115 120 125 

Gly Glu Gly Ser Ser Leu Lys Ala Thr Met Thr Val Asn Ser Gly Arg 

130 135 I 40 

Gly Tyr Val Pro Ala Asp Glu Asn Lys Lys Asp Asn Ala Pro Val Gly 
145 .150 155 160 

Thr Leu Ala Val Asp Ser He Tyr Thr Pro Val Thr Lys Val Asn Tyr 

165 I 70 175 

Gin Val Glu Pro Ala Arg Val Gly Ser Asn Asp Gly Phe Asp Lys Leu 

180 185 19° 

Thr Leu Glu He Leu Thr Asn Gly Thr He He Pro Glu Asp Ala Leu 

195 200 205 

Gly Leu Ser Ala Arg He Leu Thr Glu His Leu Asp Leu Phe Thr Asn 

210 215 220 

Leu Thr Glu He Ala Lys Ser Thr Glu Val Met Lys Glu Ala Asp Thr 
225 230 235 240 

Glu Ser Asp Asp Arg lie Leu Asp Arg Thr He Glu Glu Leu Asp Leu 

245 250 255 

Ser Val Arg Ser Tyr Asn Cys Leu Lys Arg Ala Gly He Asn Thr Val 

260 265 270 

His Asp Leu Thr Glu Lys Ser Glu Ala Glu Met Met Lys Val Arg Asn 

275 280 285 

Leu Gly Arg Lys Ser Leu Glu Glu Val Lys Leu Lys Leu lie Asp Leu 

290 295 300 

Gly Leu Gly Leu Lys Asp Lys 
305 310 

(2) INFORMATION FOR SSQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(X.i) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 
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Met Asp He Glu Lys Gin Arg Gly He Ser Val Thr Ser Ser Val Met 

15 10 15 

Gin Phe Asp Tyr Asp Gly Lys Arg Val Asn He Leu Asp Thr Pro Gly 

20 25 30 

His Glu Asp Phe Ser Glu Asp Thr Tyr Arg Thr Leu Met Ala Val Asp 

35 40 45 

Ala Ala Val Met Val Val Asp Ser Ala Lys Gly He Glu Ala Gin Thr 

50 55 60 

Lys Lys Leu Phe Glu Val Val Lys His Arg Gly He Pro Val Phe Thr 
65 70 75 go 

Phe Met Asn Lys Leu Asp Arg Asp Gly Arg Glu Pro Leu Asp Leu Leu 

85 90 95 

Gin Glu Leu Glu Glu He Leu Gly He Ala Ser Tyr Pro Met Asn Trp 

100 105 no 

Pro He Gly Met Gly Lys Ala Phe Glu Gly Leu Tyr Asp Leu Tyr Asn 

115 120 125 

Gin Arg Leu Glu Leu Tyr Lys Gly Asp Glu Arg Phe Ala Ser Pro Arg 

130 135 140 

Arg Trp Arg Gin Thr Phe Trp 
145 150 

(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;252: 

Met Ala Val lie Ala Glu Arg Lys Ala Tyr Tyr His Asp Pro Val Glu 

15 io 15 

Asp Ala He He Met Lys Arg Glu lie Asp Glu Gly 

20 25 

(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

Met Asn Glu Thr Trp Ser Arg Thr Ala Leu Thr Leu Leu Lys His Gly 

15 10 15 

Ser Glu Val Asn Leu Glu Arg Ala Leu Ser Val Asn Gly Arg Leu Gly 

20 25 30 

Gly His Val Val Thr Gly His lie Asp Gly Thr Gly Lys He Ser Ser 

35 40 45 

He Lys Lys Asp Asp Asn Ala Val Trp Tyr Gin He Asn Thr Gin Lys 

50 55 60 

Glu He Leu Asp Leu He Val Glu Lys Gly Ser He Thr He Asp Gly 
65 70 75 80 

He Ser Leu Thr Val Ala Lys Val Ser Lys Val Asn Phe Ser Val Ser 

85 90 95 

Val He Pro His Thr Leu Lys Gin* Thr He Leu Lys Ser Lys Gin Val 

100 105 110 

Gly Ser Thr Val Asn Leu Glu Asn Asp He Leu Gly Lys Tyr Val Gin 

115 120 125 

Lys Leu Met Asp Asn Ser Pro Lys Ser Glu lie Ser Lys Glu Leu Leu 

130 135 140 

Tyr Gin Asn Gly Phe 
145 

(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

Met Asn Trp Ser Ala Leu Glu Gin Gin lie Val Asp Glu Leu Lys He 

15 10 15 

Tyr lie Ala Pro Lys He Phe Gly Gly Ser Ala Lys Phe Pro Val Gly 

20 25 30 
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Gly Glu Gly lie Ser Leu Pro Asn Asp Ala He Arg Leu Lys Pro Tyr 

35 40 45 

Ala Phe Ser Gin Xaa Gly Xaa Asp Tyr Leu He Glu Ser Glu Val He 

50 55 60 

Tyr Pro Cys Ser Gin Glu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS ; single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

Met Val Leu Phe Leu Val Pro Gin Leu Arg Asn Ala Tyr Gly Thr Ala 

15 10 15 

Ala He Gly He He Cys Gly Leu Tyr Trp Ala Val Ser Ser Asn Met 

20 25 30 

Thr Val Glu Ala Thr Gin Arg Leu Thr Gly Gly Gly Gly Phe Ala He 

35 40 45 

Gly His Gin Gin Gin Phe Ala He Trp Phe Val Asp Lys Val Ala Gly 

50 55 60 

Arg Phe Gly Lys Lys Glu Glu Ser Leu Asp Asn Leu Lys Leu Pro Lys 
65 70 75 80 

Phe Leu Ser He Phe His Asp Thr Val Val Ala Ser Ala Thr Phe Asp 

85 90 95 

Ala Arg He Leu Arg Gly His Ser Phe Asn Leu Gly Ser Arg His Tyr 
100 105 HO 

Val 



(2) INFORMATION FOR SKQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS : 
{A) LENGTH; 204 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



207 



WO 97/37026 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

Met Leu He Asn Ser Leu Asn Gin Ser Tyr He Ser Ala Lys Arg He 

1 5 10 15 

Glu Glu Val Phe Ala Glu Ser Pro Glu Asn He His Ser Glu Leu Glu 

20 25 30 

Gin Lys Gin Val Thr Ser Gly Arg Val Leu Gin Val Gin Glu Leu Thr 

35 40 45 

Phe Thr Tyr Pro Asp Ala Ala Gin Pro Ser Leu Arg Asp He Ser Phe 

50 55 60 

Asp Met Thr Gin Gly Gin He Leu Gly He He Gly Gly Thr Gly Ser 
65 70 75 80 

Gly Lys Ser Ser Leu Val Gin Leu Leu Leu Gly Leu Tyr Pro Val Asp 

85 90 95 

Lys Gly Asn He Asp Leu Tyr Gin Asn Gly Arg Ser Pro Leu Asn Leu 

100 105 HO 

Glu Gin Trp Arg Ser Trp He Ala Tyr Val Pro Gin Lys Val Lys Leu 

H5 . 120 125 

Phe Lys Gly Thr He Arg Ser Asn Leu Thr Leu Gly Leu Asn Gin Glu 

130 135 140 

Val Ser Asp Gin Lys Leu Trp Gin Ala Leu Glu He Ala Gin Ala Lys 
145 150 155 160 

Asp Phe Val Ser Glu Lys Glu Gly Leu Leu Asp Ala Leu He Glu Ala 

165 170 175 

Gly Gly Arg Asn Phe Ser Gly Gly Gin Lys Gin Arg Leu Ser He Ala 

1B0 185 190 

Arg Ala Val Leu Arg Gin Ala Pro Phe He He Leu Asp Asp Ala Thr 

195 200 205 

Ser Ala Leu Asp Thr He Thr Glu Ser Lys Leu Leu Lys Ala He Arg 

210 215 220 

Glu Asn Phe Pre Asn Thr Ser Leu He Leu He Ser Gin Arg Thr Ser 
225 230 235 240 

Thr Leu Gin Met Ala Asp Gin He Leu Leu Leu Glu Lys Gly Glu Leu 

245 250 255 

Leu Ala Val Gly Lys His Asp Asp Leu Met Lys Ser Ser Gin Val Tyr 

260 265 270 

Arg Glu He Asn Ala Ser Gin His Gly Lys Glu Asp 
275 280 

(2) INFORMATION FOR SEQ ID NO:257: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

Met Lys Asp Pro Glu Ser Arg Leu Tyr He Leu Leu Lys Asp Gly Gin 

15 10 is 

Val He Gly Thr Cys Thr Val Asp Leu Ser Thr Asn Thr Asn Tyr Phe 

20 25 30 

Tyr Gly Leu Ala He Leu Glu Pro Glu Arg Gly Lys Gly Tyr Gly Ser 

35 40 45 

Tyr Leu Ala Lys Ser Leu Val Asn Gin Leu He Glu Gin Asn Asp Lys 

50 55 60 

Glu Phe Gin lie Ala Val Glu Asp Ser Asn Val Gly Ala Lys Arg Leu 
65 70 75 80 

Tyr Glu Lys lie Gly Phe Val Lys Gin Thr Gin Val Val Tyr Leu Asn 

85 90 95 

Glu Lys Gly Ala Arg Asp Ser Glu Val 
100 105 

(2) INFORMATION FOR SEQ ID NO:258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

Met Phe Ala Leu Ser Ser Asn Lys Glu Leu Ala Glu Arg Val Ala Gin 

15 10 15 

Glu lie Gly He Glu Leu Gly Lys Ser Ser Val Arg Gin Phe Ser Asp 

20 25 30 

Gly Glu lie Gin Val Asn He Glu Glu Ser He Arg Gly Lys His Val 

35 40 45 

Phe He Leu Gin Ser Thr Ser Ser Pro Val Asn Asp Asn Leu Leu Glu 
50 55 60 
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He Leu He Met Val Asp Ala Leu Lys Arg Ala Ser Ala Glu Ser Val 
65 70 75 80 

Asn Val Val Met Pro Tyr Tyr Gly Tyr Ala Arg Gin Asp Arg Lys Ala 

85 90 95 

Arg Ala Arg Glu Pro He Thr Ser Lys Leu Val Ala Asn Met Leu Glu 

100 105 110 

Val Ala Gly Val Asp Arg Leu Leu Thr He Asp Leu His Ala Ala Gin 

115 120 125 

lie Gin Gly Phe Phe Asp He Pro Val Asp His Leu Met Gly Ala Pro 

130 135 140 

Leu He Ala Asp Tyr Phe Glu Arg Pro Gly Met Val Gly Ser Asp Tyr 
145 150 155 160 

Val Val Val Ser Pro Asp His Gly Gly Val Thr Arg Ala Arg Lys Leu 

165 170 175 

Ala Glu Phe Leu Lys Thr Ser He Ala He He Glu Lys Arg Arg Ser 

180 185 190 

Val Asp Lys Met Asn Thr Ser Glu Val Met Asn Thr He Gly Lys Val 

195 200 205 

Glu Gly Asn His Leu 
210 

(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS: single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:259: 

Met He Ser Phe Leu Leu Leu Leu Val Leu Val Trp Gly Phe Tyr He 

15 10 15 

Gly Tyr Arg Arg Gly Leu Leu Leu Gin Val Tyr Tyr Leu He Ser Ala 

20 25 30 

Met Ala Ser Ala Phe Met Ala Gly Gin Phe Tyr Lys Gly Leu Gly Glu 

35 40 45 

Gin Phe His Leu Leu Leu Pro Tyr Ala Asn Ser Gin Glu Gly Gin Gly 

50 55 60 

Thr Phe Phe Phe Pro Ser Asp Gin Leu Phe Gin Leu Asp Lys Val Phe 
65 70 75 80 



210 



WO 97/37026 PCT/US97/05306 

Tyr Ala Gly lie Gly Tyr Leu Leu Val Phe Gly lie Val Tyr Ser He 

85 90 95 

Gly Arg Leu Leu Gly Leu Leu Leu His Leu He Pro Ser Lys Lys Leu 

100 105 110 

Gly Gly Lys Leu Phe Gin Val Ser Ala Gly He Leu Ser Met Leu Val 

115 120 125 

Thr Leu Phe Val Leu Gin Met Ala Leu Thr He Leu Ala Thr He Pro 

130 135 140 

Met Ala Val He Gin Asn Pro Leu Glu Lys Ser He Val Ala Lys His 
145 150 155 160 

He He Gin Ser He Pro He Thr Thr Ser Trp Leu Lys Gin He Trp 

165 170 175 

Val Thr Asn Leu He Gly 
180 



(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE;, amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 260: 



Met Arg Ala Phe Ser Asp Phe Gly Thr Pro Met Leu He Gly Glu Gly 

15 10 15 

Tyr Arg Thr Phe Pro Val Leu He Tyr Thr Gin Phe He Ser Glu Val 

20 25 30 

Gly Gly Asn Ser Ala Phe Ala He Met Ala He He He Ala Leu Ala 

35 40 45 

He Phe Leu He Gin Lys His He Ala Asn Arg Tyr Ser Phe Ser Met 

50 55 60 

Asn Leu Leu His Pro lie Glu Pro Lys Lys Thr Thr Lys Gly Lys Met 
65 70 75 80 

Ala Ala He Tyr Ala Thr Val Tyr Gly He lie Phe He Ser Val Leu 

85 90 95 

Pro Gin lie Tyr Leu lie Tyr Thr Ser Phe Leu Lys Thr Ser Gly Met 

100 105 110 

Val Phe Val Lys Gly Tyr Ser Pro Asn Ser Tyr Lys Val Ala Phe Asn 
115 120 125 
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Arg Met Gly Ser Ala lie Phe Asn Thr He Arg He Pro Leu He Ala 

130 135 140 

Leu Val Leu Val Val Pro He Tyr Asp He Tyr Leu Leu Pro Ser Arg 

145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 



Met Val Pro Tyr He Val Pro Gly 

1 5 
Ser Ser Phe Asn Thr Gly Leu Phe 
20 

Gly Thr Ala Phe He Leu lie Met 

35 40 
Tyr Thr He Arg Ser Ser Val Ala 

50 5 5 

He Glu Glu Ala Ala Gly Lys Leu 
65 70 



Thr Val Leu Gly He Ala Phe He 

10 15 
Gly Ser Gly Phe Leu Met He Thr 
25 30 
Ser Leu Ser Val Arg Arg Leu Pro 
45 

Ser Leu Gin Gin He Ala Pro Ser 
60 

Arg Lys 



(2) INFORMATION FOR SEQ ID NO: 262: 



(i) SEQUENCE CHARACTERISTICS: 
(AJ LENGTH: 48 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 

Met He Phe lie Pro Met Ala Ala Tyr Ser He Ala Arg Asn Met Ser 

1 5 10 15 

Lys Arg Lys Ala Phe Thr He His Val Tyr Pro Leu Asn Pro Arg Asn 
20 25 30 
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Leu Arg Thr Phe Pro Ser His His Asp Ser Asp Tyr Gly Tyr Asp Glu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

Met Lys Phe Arg Lys Leu Ala Cys Thr Val Leu Ala Gly Ala Ala Val 

15 10 15 

Leu Gly Leu Ala Ala Cys Gly Asn Ser Gly Gly Ser Lys Asp Ala Ala 

20 25 30 

Lys Ser Gly Gly Asp Gly Ala Lys Thr Glu lie Thr Trp Trp Ala Phe 

35 40 45 

Pro Val Phe Thr Gin Glu Lys Thr Gly Asp Gly Val Gly Thr Tyr Glu 

50 55 60 

Lys Ser lie He Gin Ala Phe 
65 70 

(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION : SEQ ID MO: 264: 

Met Asn Ser Gin Gin Lys Pro Gin Glu He Lys Val Glu Glu Pro Val 

15 10 is 

Glu Ser Lys Glu Glu Thr Val Asn Gin Pro Val Glu Gin Pro Lys Val 

20 25 30 

Glu Thr Pro Ala Val Glu Lys Gin Thr Glu Pro Thr Glu Glu Pro Lys 
35 40 45 
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Val Glu Val Thr Ser lie Pro Gin Thr Thr Arg Tyr Glu Glu Asp Leu 

50 55 60 

Thr Lys Glu His Gly Thr Arg Glu Val Val Lys Glu Gly Lys Asn Gly 
65 70 75 80 

Ser Arg Thr Val Thr Thr Pro Tyr lie Leu Asn Ala Thr Asp Gly Thr 

85 90 95 

Thr Thr Glu Gly Thr Ser Thr Thr Asp Glu Ala Glu Met Glu Lys Glu 

100 105 110 

Val Val Arg Val Gly Thr Lys Pro Lys Glu Lys Leu Ala Pro Val Leu 

115 120 125 

Ser Leu Thr Ser Val Thr Asp Asn Ala Met Leu Arg Ser Ala Arg Leu 

130 135 140 

Thr Tyr His Leu Glu Asn Thr Asp Ser Val Asp Val Lys Lys lie His 
145 150 155 160 

Ala Glu lie Lys Asn Gly Asp Lys Val Val Lys Thr lie Asp Leu Ser 

165 170 175 

Lys Glu Arg Leu Ser Asp Ala Val Asp Gly Leu Glu Leu Tyr Lys Asp 

180 185 190 

Tyr Lys lie Val Thr Ser Met Thr Tyr Asp Arg Gly Asn Gly Glu Glu 

195 200 205 

Thr Ser Thr Leu Glu Glu Thr Pro Leu Arg Leu Asp Leu Lys Lys Val 

210 215 220 

Glu Leu Lys Asn lie Gly Ser Thr Asn Leu Val Lys Val Asn Glu Asp 
225 230 235 240 

Gly Thr Glu Val Ala Ser Asp Phe Leu Thr Ser Lys Pro Val Asp Val 

245 250 255 

Gin Asn Tyr Tyr Leu Lys Val Thr Ser Arg Asp Asn Lys Val Val Ser 
260 265 270 

Pro Pro Ser 

275 



(2) INFORMATION FOR SEQ ID NO: 265: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 65: 
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Met Ala Val Met Lys lie Glu Tyr Tyr Ser Gin Val Leu Asp Met Glu 

15 10 15 

Trp Gly Val Asn Val Leu Tyr Pro Asp Ala Asn Arg Val Glu Glu Pro 

20 25 30 

Glu Cys Glu Asp lie Pro Val Leu Tyr Leu Leu His Gly Met Ser Gly 

35 40 45 

Asn His Asn Ser Trp Leu Lys Arg Thr Asn Val Glu Arg Leu Leu Arg 

50 55 60 

Gly Thr Asn Leu lie Val Val Met Pro Asn Thr Ser Asn Gly Trp Tyr 
65 70 75 80 

Thr Asp Thr Gin Tyr Gly Phe Asp Tyr Tyr Thr Ala Leu Ala Glu Glu 

85 90 95 

Leu Pro Gin Val Pro Glu Thr Leu Leu Pro 
100 105 

(2) INFORMATION FOR SEQ ID NO:266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 66: 

Met Arg Leu Leu Ala Met Lys Met Lys Gin lie Ser Asp Thr Thr Leu 

15 10 15 

Lys lie Thr Met Ser Leu Glu Asp Leu Met Asp Arg Gly Met Glu He 

20 25 30 

Ala Asp Phe Leu Val Pro Gin Glu Lys Thr Glu Glu Phe Phe Tyr Ala 

35 40 45 

He Leu Gly 
50 

(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

Met Leu Pro He He Cys Cys Gly Glu Ser Leu Glu Thr Tyr Glu Ala 

I 5 10 15 

Gly Lys Ala Ala Glu Phe Val Gly Ala Gin Val Ser Ala Ala Leu Ala 

20 25 30 

Gly Leu Thr Ala Glu Gin Val Ala Ala Ser Val He Ala Tyr Glu Pro 

35 40 45 

He Trp Ala He Gly Thr Gly Lys Ser Ala Ser Gin Asp Asp Ala Gin 

50 55 60 

Lys Met Cys Lys Val Val Arg Asp Val Val Ala Ala Asp Phe Gly Gin 
65 70 75 80 

Glu Val Ala Asp Lys Val Arg Val Gin Tyr Gly Gly Ser Val Lys Pro 

85 9° 95 

Glu Asn Val Ala Ser Tyr Met Ala Cys Pro Asp Val Asp Gly Ala Leu 

100 no 
Val Gly Gly Ala Ser Leu Glu Ala Glu Ser Phe Leu Ala Leu Leu Asp 

u5 120 125 

Phe Val Lys 
130 

(2) INFORMATION FOR SEQ ID NO: 2 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

Met Phe val Glu Ala Cys Lys Ala Val Val Arg Ala Asn Glu Glu Tyr 

1 5 10 15 

Val Pro Pro Tyr Gly He Gly Gly Thr Leu Tyr Leu Arg Pro Leu Leu 

20 25 30 

He Gly Val Gly Asp lie He Gly Val Lys Pro Ala Glu Glu Tyr He 

35 40 45 

Phe Thr lie Phe Ala Met Pro Val Gly Asn Tyr Phe Lys Gly Gly Leu 
50 
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Val Pro Thr Asn Phe Leu He Gin Asp Glu Tyr Asp Arg Ala Ala Pro 
65 70 75 80 

Asn Gly Thr Gly Ala Ala Lys Val Gly Gly Asn Tyr Ala Ala Ser Leu 

85 90 95 

Leu Pro Gly Lys Met Ala Lys Ser Arg His Phe Ser Asp Val He Tyr 

100 105 110 

Leu Asp Pro Ser Thr His Thr Lys lie Glu Glu Val Gly Ser Ala Asn 

115 120 125 

Phe Phe Gly He Thr Ala Asp Asn Glu Phe Val Thr Pro Leu Ser Pro 

130 135 140 

Ser He Leu Pro Ser He Thr Lys Tyr Ser Leu Leu Tyr Leu Ala Glu 
145 150 155 160 

His Arg Leu Gly Leu Thr Pro He Glu Gly Asp Val Pro He Asp Asn 

165 170 175 

Leu Asp Arg Phe Val Lys Ala Gly Ala Cys Gly Thr Ala Ala Val He 

180 185 190 

Ser Pro He Gly Gly He Gin His Gly Asp Asp Phe His Val He Leu 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 269: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 



Met Val Met He Ser Met Leu Phe Tyr Ser Glu Thr Glu Val Gly Pro 

15 10 15 

Val Thr Arg Lys Leu Tyr Asn Glu Leu Thr Gly He Gin Phe Gly Asp 

20 25 30 

He Glu Ala Pro Glu Gly Trp He Val Lys Val Asp 
35 40 



(2) INFORMATION FOR SEQ ID NO:270: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270 : 

Met Thr Ala Ser Pro Leu Lys Lys Ser lie Lys Lys Lys Asn Arg Lys 

1 5 10 15 

Leu Thr Asn Lys Xaa Glu Lys His Met Ser Lys He He Gly He Asp 

20 25 30 

Leu Gly Thr Thr Asn Ser Ala Val Ala Val Leu Glu Gly Thr Glu Ser 

35 40 45 

Lys He He Ala Asn Pro Glu Gly Asn Arg Thr Thr Pro Ser Val Val 

50 55 60 

ser Phe Lys Asn Gly Glu He He Val Gly Asp Ala Ala Lys Arg Gin 
65 70 75 80 

Ala Val Thr Asn Pro Asp Thr Val He Ser He Lys Ser Lys Met Gly 

85 9° 95 

Thr Ser Glu Lys Val Ser Ala Asn Gly Lys Glu Tyr Thr Pro Gin Glu 

100 105 HO 

He Ser Ala Met He Leu Gin Tyr Leu Lys Gly Tyr Ala Glu Asp Tyr 

H5 120 125 

Leu Gly Glu Lys Val Thr Lys Ala Val He Thr Val Pro Ala Tyr Phe 

130 135 140 

Asn Asp Ala Gin Arg Gin Ala Thr Lys Asp Ala Gly Lys He Ala Gly 
145 150 155 160 

Leu Glu Val Xaa Arg He Val Asn Glu Pro Thr Ala Ala Ala Leu Ala 

165 170 175 

Tyr Gly Leu Asp Lys Thr Asp Lys Glu Glu Lys lie Leu Val Phe Asp 

180 185 I 90 

Leu Gly Gly Gly Thr Phe Asp Val Ser He Leu Glu Leu Gly Asp Gly 

195 200 205 

Val Phe Asp Vc.1 Leu Ser Thr Ala Gly Asp Asn Lys Leu Gly Gly Asp 

210 215 220 

Asp Phe Asp Gin Lys He He Asp His Leu Val Ala Glu Phe Lys Lys 
225 230 235 240 

Glu Asn Gly He Arg Leu Val Tyr 
245 



(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 eitd.no arid-3 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 
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(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:271: 

Met Lys Leu lie Val Asp Leu lie Tyr Glu Gly Gly Phe Lys Lys Met 

15 10 15 

Arg Gin Ser lie Ser Asn Thr Ala Glu Tyr Gly Asp Tyr Val Ser Gly 

20 25 30 

Pro Arg Val lie Thr Glu Gin Val Lys Glu Asn Met Lys Ala Val Leu 

35 40 45 

Ala Asp lie Gin Asn Gly Lys Phe Ala Asn Asp Phe Val Asn Asp Tyr 

50 55 60 

Lys Ala Gly Arg Pro Lys Leu Thr Ala Tyr Arg Glu Gin Ala Ala Asn 
65 70 75 80 

Leu Glu lie Glu Lys Val Gly Ala Glu Leu Arg Lys Ala Met Pro Phe 

85 90 95 

Val Gly Lys Asn Asp Asp Asp Ala Phe Lys lie Tyr Asn 
100 105 

(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

tC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

Met Leu Leu Ser Ala Phe His Lys Tyr Glu Asn Gin Leu Asp Lys Val 

i 5 10 15 

Arg Phe lie Gly Leu His Thr Gly His Leu Gly Phe Tyr Thr Asp Tyr 

20 25 30 

Arg Asp Phe Glu Leu Asp Lys Leu Val Thr Asn Leu Gin Leu Asp Thr 

35 40 45 

Gly Ala Arg Val Ser Tyr Pro Val Leu Asn Val Lys Val Phe Leu Glu 

50 5b 60 

Asn Gly Glu Val Lys lie Phe Arg Ala Leu Asn Glu Ala Ser lie Arg 
65 70 75 80 
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Arg Ser Asp Arg Thr Met Val Ala 
85 

Phe Glu Arg Phe Arg Gly Asp Gly 
100 

Ser Thr Ala Tyr Asn 
115 
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Asp He Val He Asn Gly Val Pro 

90 95 
Leu Thr Val Ser Thr Pro Thr Gly 
105 110 



(2) INFORMATION FOR SEQ ID NO: 273: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 

Met Gly Ala Gin Leu Ala Arg Glu Phe Lys His Glu Ala Asp He Val 

1 5 10 15 

Val Gly Val Pro Asn Ser Ser Leu Ser Ala Ala Met Gly Phe Ala Glu 

20 25 30 

Glu Ser Gly Leu Pro Asn Glu Met Gly Leu He Lys Asn Gin Tyr Thr 

35 40 45 

Gin Arg Thr Phe He Gin Pro Thr Gin Glu Leu Arg Glu Gin Gly Val 

50 55 60 

Arg Met Lys Leu Ser Ala Val Ser Gly Val Val Lys Gly Lys Arg Val 
65 70 75 80 

Val Met Val Asp Asp Ser He Val Arg Gly Thr Thr Ser Arg Arg He 

85 90 95 

Val Gin Leu Leu Lys Glu Ala Gly Ala Thr Glu Val His Val Ala He 

100 105 HO 

Gly Ser Pro Ala Leu Ala Tyr Pro Cys Phe Tyr Gly He Asp He Gin 

115 120 125 

Thr Arg Gin Glu Leu He Ala Ala Asn His Thr Val Glu Glu Thr Arg 

130 135 140 

Gin He He Gly Ala Asp Ser Leu Thr Tyr Leu Ser He Asp Ser Leu 
145 150 155 160 

He Glu Ser lie Gly He Glu Thr Asp Ala Pro Asn Gly Gly Leu Cys 

165 170 175 

Val Ala Tyr Phe Asp Gly Asp Tyr Pro Thr Pro Leu Tyr Asp Tyr Glu 
180 185 I* 0 
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Glu Asp Tyx Arg Arg Ser Leu Glu Glu Lys Thr Ser Phe Tyr Lys 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 274: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:274: 

Met Lys lie Leu Val Thr Gly Phe Asn Pro Phe Gly Gly Glu Lys He 

15 10 15 

Asn Pro Ala Leu Glu Ala Val Lys Leu Leu Pro Ser Glu He Asn Gly 

20 25 30 

Ala Glu Val Arg Trp Val Glu He Pro Thr Val Phe Tyr Lys Ser Ser 

35 40 45 

Glu Val Leu Glu Ala Glu He Leu Arg Tyr Gin Pro Asp Ala Val Leu 

50 55 60 

Cys He Gly Gin Ala Gly Gly Arg Thr Gly Leu Thr Pro Glu Arg Val 
65 70 75 80 

Ala He Asn Gin Asp Asp Ala Arg He Pro Asp Asn Glu Gly Asn Gin 

85 90 95 

Pro He Asp Thr Pro He Arg He Asp Gly Ala Ser Ala Tyr Phe Ser 

100 105 no 

Ser Leu Pro lie Lys Ala Met Val Gin Ala Asn Lys Lys Glu Gly Leu 

115 120 125 

Thr Gly Ser Leu Phe Pro He Arg Ala Gly Thr Phe Val Cys Ser His 

130 135 140 

Leu Met Tyr Gin Ala Leu Tyr Leu Val Glu Lys Lys Phe Pro Tyr Val 
145 150 155 160 

Lys Ala Gly Phe Met His He Pro Tyr Met Met Glu Gin Val Val Asn 

165 170 175 

Arg Pro Thr Thr Pro Thr Met Ser Leu Val Asp He Arg Arg Gly He 

180 185 190 

Glu Ala Ala He Gly Ala Met H* Glu His Gly Asp Gin Glu Leu Thr 

195 200 205 

Leu Val Gly Gly Glu He His 
210 215 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

Met Leu Val Pro Lys Arg Val Lys His Arg Arg Glu Phe Arg Gly Lys 

15 10 15 

Met Arg Gly Glu Ala Lys Gly Gly Lys Glu Val Ala Phe Gly Glu Tyr 

20 25 30 

Gly Leu Gin Ala Thr Thr Ser His Trp He Thr Asn Arg Gin He Glu 

35 40 45 

Ala Ala Arg He Ala Met Thr Arg Tyr Met Lys Arg Gly Gly Lys Val 

50 55 60 

Trp He Lys . He Phe Pro His Lys Ser Tyr Thr Ala Lys Ala He Gly 
65 70 75 80 

Val Arg Met Gly Ser Gly Lys Gly Ala Pro Glu Gly Trp Val Ala Pro 

85 90 95 

Val Lys Arg Gly Lys Val Met Phe Glu He Ala Gly Val Ser Glu Glu 

100 105 HO 

He Ala Arg Glu Ala Leu Arg Leu Ala Ser His Lys Leu Pro Val 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 : 

Met Gly Trp Trp Arg Glu Thr He Asp He Val Lys Glu Asn Asp Pro 
15 10 15 
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Ala Ala Arg Thr Thr Leu Glu Val Leu Leu Thr Tyr Pro Gly Val Lys 

20 25 30 

Ala Leu Ala Ala His Arg Leu Ser His Phe Leu Tip Lys Tyr Asp Phe 

35 40 45 

Lys Leu Leu Ala Arg Met His Ser Gin Phe Tip Arg Phe Trp Thr Gin 

50 55 60 

He Glu He His Pro Gly Ala Gin lie Asp Ser Gly Val Phe He Asp 
65 70 75 80 

His Gly Ser Gly Leu Val He Gly Glu Thr Ala He Val Glu Lys Gly 

85 90 95 

Val Leu Leu Tyr His Gly Val Thr Leu Gly Gly Thr Gly Lys Asp Cys 

100 105 110 

Gly Lys Arg His Pro Thr Val Arg Lys Gly Ala Leu He Ser Ala His 

115 120 125 

Ala Gin Val He Gly Pro Val Glu lie Gly Glu Asn Ala Lys Val Gly 

130 135 140 

Ala Ala Ala Val Val Val Ala Asp Val Pro Ser Asp Val Thr Val Val 
145 150 155 160 

Gly He Pro Ala Lys He Val Arg Leu His Gly Lys Lys Asp Glu Pro 

165 170 175 

Val He His Glu Val Glu Glu Lys Arg Glu Tyr Tyr Val Asn Lys Leu 

180 185 190 

Glu Gin Ala Lys Asp Ala Ser His Arg Ser Ser Gly Leu 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 277: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

tii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 



Met Leu Phe Tyr Leu Leu Arg Asp 

1 5 
Thr Gin Phe He Pro Ser Lys Lea 
20 

Ser Asp Val Asn Gin Gin Leu Ser 
35 40 



Gly Lys Gly Leu Arg Asn Tyr Leu 

10 15 
Lys Glu Pro Val Gly Gin Val Leu 
25 30 
Asn Tyr Val Arg Gly Gin Val Thr 
45 
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Val Ala He He Val Ala Val Met Phe He He Phe Phe Lys He He 

50 55 60 

Gly Leu Arg Tyr Ala Val Thr Leu Gly Val Thr Ala Gly He Leu Asn 
65 70 75 80 

Leu Val Pro Tyr Leu Gly Ser Phe Leu Ala Met Leu Pro Ala Leu Val 

85 90 95 

Leu Gly Leu He Ala Gly Pro Val Met Leu Leu Lys Val Val lie Val 

100 105 HO 

Phe He Val Glu Gin Thr He Glu Gly Arg Phe Val Ser Pro Leu He 

115 120 125 

Leu Gly Ser Gin Leu Asn He His Pro He Asn Val Leu Phe Val Leu 

130 135 140 

Leu Thr Ser Gly Ser Met Phe Gly He Trp Gly Val Leu Leu Gly He 
145 150 155 160 

Pro Val Tyr Ala Ser Ala Lys Val Val He Ser Ala He Phe Glu Trp 

165 170 175 

Tyr Lys Val Val Ser Gly Leu Tyr Glu Leu Glu Gly Glu Glu Val Lys 
180 185 190 

Ser Glu Gin 
195 



(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

Met Thr Lys Gin Val Pro Lys Phe Thr Lys Asp Thr Ala Gin Leu Tyr 

1 5 10 15 

Thr Cys Lys Trp Leu Leu Tyr Asn Lys Val Thr Lys Met Tyr Asp His 

20 25 30 

Thr Val Val Asn His Ser Val Arg Glu Tyr He Thr Asp Ser He Ser 

35 40 45 

Thr Asn Thr He Glu Glu Thr Gly 
50 55 

(2) INFORMATION FOR SEQ 10 NO: 279: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 



Met Lys Lys Arg Ala lie Val Ala Val He Val Leu Leu Leu He Gly 

15 10 15 

Leu Asp Gin Leu Val Lys Ser Tyr He Val Gin Gin He Pro Leu Gly 

20 25 30 

Glu Val Arg Ser Trp lie Pro Asn Phe Val Ser Leu Thr Tyr Leu Gin 

35 40 45 

Asn Arg Gly Ala Ala Phe Ser He Leu Gin Asp Gin Gin Leu Leu Phe 

50 55 60 

Ala Val He Thr Leu Val Val Val He Gly Ala He Trp Tyr Leu His 
65 70 75 80 

Lys His Met Glu Asp Ser Phe Trp Met Val Leu Gly Leu Thr Leu He 

85 90 95 

lie Ala Gly Gly Leu Gly Asn Phe He Asp Arg Val Ser Gin Gly Phe 

100 105 110 

Val Val Asp Met Phe His Leu Asp Phe lie Asn Phe Ala He Phe Asn 

115 120 125 

Val Ala Asp Asn Tyr Leu Thr Val Gly Val lie lie Leu Leu lie Ala 

130 135 140 

Met Leu Lys Glu Glu He Asn Gly Asn 
145 150 



(2) INFORMATION FOR SEQ ID NO; 280: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:280: 
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Met Gly Ala Tyr Tyr Leu Val Met Gin Ser Leu Ser Tyr Leu Glu Tyr 

15 10 15 

Glu Gin Gly He Gin Ser Thr Thr Val Arg His Leu He Leu Val Phe 

20 25 30 

Tyr Leu Leu Phe Phe Met Gly Gly He Lys Lys Leu Asp Thr Tyr Leu 

35 40 45 

Lys Glu Lys Leu Gin Glu Glu Leu Asn Gin Glu Gin Thr Leu Arg Tyr 

50 55 60 

Arg Asp Met Glu Arg Tyr Ser Arg His He Glu Glu Leu Tyr Lys Glu 
65 70 75 80 

He Arg Ser Phe Arg His Asp Tyr Thr Asn Leu Leu Thr Thr Tyr Val 

85 90 95 

Trp Ala Leu Lys Arg Arg He Trp Ser Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH : 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

Met His Val Arg Met He Pro Lys Ser Thr Pro Asp Thr Lys Phe Ala 

1 5 10 15 

Asp Val Ala Thr His Gin Pro Glu Tyr Ser Arg Asp Asn Val Ala Gly 

20 25 30 

Thr He Val Gly Phe Trp Thr Pro Glu He Phe His Gly Val Ser Val 

35 40 45 

Ala Gly Tyr His Leu His Phe He Ser Asp Asp Leu Thr Phe Gly Gly 

50 55 60 

His Val Met Asp Phe Val He Lys Glu Gly He He Glu Val Gly Ala 
65 70 75 80 

Val Asp Gin Leu Asp Gin Arg Phe Pro Val Gin Asp Arg Gin Tyr Leu 

85 90 95 

Phe Ala Lys Phe Asn Val Asp Glu Met Lys Lys Asp He Glu Lys Ala 
100 105 HO 

Glu 
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(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

Met Ala Val Ala Lys Gly Lys Leu Thr lie lie Ala His Val Ala Cys 

15 10 as 

Asn Asn Thr Lys Asp Ser Met Glu Leu Ala Arg His Ala Glu Ser Leu 

20 25 30 

Gly Val Asp Ala He Ala Thr Asp Ser Thr Asn Leu Phe Pro Leu Ala 

35 40 45 

Arg He Thr Gin Leu Pro Asn Thr Gly Thr He Ser Val Leu Gin Leu 

50 55 60 

Gin Thr Gin Thr Thr 
65 

(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

Met Gly Glu Thr Gin He He Asp Gly Leu Asp Pro Glu Tyr Lys Lys 

1 5 10 is 

Arg Phe Met His His Tyr Asn Phe Pro Gin Tyr Ser Val Gly Glu Thr 

20 25 30 

Gly Arg Tyr Gly Ala Pro Gly Arg Arg Glu He Gly His Gly Ala Leu 

35 4C 45 

Gly Glu Arg Ala Leu Ala Gin Val Leu Pro Ser Leu Glu Glu Phe Pro 
50 55 60 
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Tyr Ala lie Arg Leu Val Ala Glu Val Leu Glu Ser Asn Gly Ser Ser 
65 70 75 80 

Ser Gin Ala Ser He Cys Ala Gly Thr Leu Ala Leu Met Ala Gly Gly 

85 90 95 

Val Pro He Lys Ala Pro Val Ala Gly He Ala Met Gly Leu He Ser 

100 105 110 

Asp Gly Asn Asn Tyr Thr Val Leu Thr Asp He Gin Gly Leu Glu Asp 

115 120 125 

His Phe Gly Asp Met Asp Phe Lys Val Ala Gly Thr Arg Asp Gly He 

130 135 140 

Thr Ala Leu Gin Met Asp He Lys He Gin Gly lie Thr Ala Glu He 
145 150 155 160 

Leu Thr Glu Ala Leu Ala Gin Ala Lys Lys Ala Arg Phe Glu He Leu 

165 170 175 

Asp Val He Glu Ala Thr He Pro Glu Val Arg Pro Glu Leu Ala Pro 

180 185 190 

Thr Ala Pro Lys He Asp Thr He Lys He Asp Val Asp Lys He Lys 

195 200 205 

He Val He Gly Lys Gly Gly Glu Thr He Asp Lys He lie Ala Glu 

210 215 220 

Thr Gly Val Lys He Asp He Asp Glu Glu Xaa Asn Val Phe Tyr Leu 
225 230 235 240 

Leu Leu Val Asp Gin Asn Ala lie Asn Pro Cys Pro Lys Lys Leu Leu 

245 250 255 

Leu Val Trp Phe Arg Glu Pro Lys Trp Met Lys Phe Thr Val Leu Asn 

260 265 270 

Arg Ser Tyr Arg Glu Phe Gly Ala Phe Val Thr Leu 
275 280 

(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

Met lie Asn Asn Val Val Leu Val Gly Arg Met Thr Arg Asp Ala Glu 
x 5 10 15 
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Leu Arg Tyr Thr Pro Ser Asn Val Ala Val Ala Thr Phe Thr Leu Ala 

20 25 30 

Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 

35 40 45 

lie Asn Val Val Met Trp Arg Gin Gin Ala Glu Asn Leu Ala Asn Trp 

50 55 60 

Ala Lys Lys Gly Ser Leu lie Gly Val Thr Gly Arg lie Gin Thr Arg 
65 70 75 80 

Ser Tyr Asp Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val 

85 90 95 

Ala Glu Asn Phe Gin Met Leu Glu Ser Arg Ser Val Arg Glu Gly His 

100 105 110 

Thr Gly Gly Ala Tyr Ser Ala Pro Thr Ala Asn Tyr Ser Ala Pro Thr 

115 120 125 

Asn Ser Val Pro Asp Phe Ser Arg Asn Glu Asn Pro Phe Gly Ala Thr 

130 135 140 

Asn Pro Leu Asp lie Ser Xaa Asp Asp Leu Pro Phe 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

Met Asp Leu Val Arg lie Leu Lys Glu Pro Arg Asn Ala Leu Val Lys 

15 10 15 

Gin Tyr Gin Thr Leu Leu Ser Tyr Asp Asp Val Glu Leu Glu Phe Asp 

20 25 30 

Asp Glu Ala Leu Gin Glu lie Ala Asn Lys Ala lie Glu Arg Lys Thr 

35 40 45 

Gly Ala Arg Gly Leu Arg Ser lie He Glu Glu Thr Met Leu Asp Val 

50 55 60 

Met Phe Glu Val Pro Ser Gin Glu Asn Val Lys Leu Val Arg He Thr 
65 70 75 80 

Lys Glu Thr Val Asp Gly Thr Asp Lys Pro He Leu Glu Thr Ala 
85 90 95 
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{2) INFORMATION FOR SEQ ID NO: 286: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286 : 

Met Thr Glu Ala Gly Tyr Val Gly Glu Asp Val Glu Asn lie Leu Leu 

x 5 10 15 

Lys Leu Leu Gin Val Ala Asp Phe Asn He Glu Arg Ala Glu Arg Gly 

20 25 30 

He He Tyr Val Asp Glu He Asp Lys He Ala Lys Lys Ser Glu Asn 

35 40 45 

Val Ser He Thr Arg Asp Val Ser Gly Glu Gly Val Gin Gin Ala Leu 

50 55 60 

Leu Lys lie lie Glu Gly Thr Val Ala Ser Val Pro Pro Gin Gly Gly 
65 70 75 80 

Arg Lys His Pro Gin Gin Val Asp Asp Ser Ser Gly Tyr Lys Lys Tyr 

85 90 95 

Pro Leu His Arg Gly Trp Cys Phe 
100 



(2) INFORMATION FOR SEQ ID NO: 2 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

Met Trp Pro Arg Glu Ser Arg Ser Cys Arg Leu Thr Lys Val Lys He 
5 10 15 
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Cys Gly Leu Ser Thr Lys Glu Ala Val Glu Thr Ala Val Ser Ala Gly 

20 25 30 

Ala Asp Tyr He Gly Phe Val Phe Ala Pro Ser Lys Arg Gin Val Thr 

35 40 45 

Leu Glu Glu Ala Ala Glu Leu Ala Lys Leu He Pro Ala Asp Val Lys 

50 55 60 

Lys Val Gly Val Phe Val Ser Pro Ser Arg Val Glu Leu Leu Glu Ala 
65 70 75 80 

He Asp Lys Val Gly Leu Asp Leu Val Gin Val His Gly Gin Val Ala 

85 90 95 

Asp Asp Leu Phe Glu Asn Leu Pro Cys Ala Ser lie Gin Ala Val Gin 

100 105 no 

Val Asp Gly Asn Gly His Val Pro Asn Ser Gin Ala Asp Tyr Leu Leu 

115 120 125 

Phe Asp Ala Pro Val Ala Gly Ser Gly Gin Ser Phe Asp Trp Gly Gin 

130 135 140 

Leu Asp Thr Thr Gly Leu Ala Gin Pro Phe Phe He Ala Gly Gly Leu 
145 150 155 160 

Asn Glu Asp Asn Val Val Lys Ala He Gin His Phe Thr Pro Tyr Ala 

165 170 175 

Val Asp Val Ser Ser Gly Val Glu Thr Asp Gly Gin Lys Asp His Glu 

180 185 190 

Lys He Arg Arg Phe He Glu Arg Val Lys His Gly He Ser Gly Thr 
195 200 205 

Lys 



(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

Met Asn Cys Glu Ala Val Ala Leu Gly Ser Phe Cys Glu Leu Lys Ser 

15 10 15 

Arg Arg Gly Lys Lys Gin Lys Gly Glu He Leu Met Ala Val He Ser 
20 25 30 
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Met Lys Gin Leu Leu Glu Ala Gly Val His Phe Gly His Gin Thr Arg 

35 40 45 

Arg Trp Asn Pro Lys Met Ala Lys Tyr lie Phe Thr Glu Arg Asn Gly 

50 55 60 

lie His Val He Asp Leu Gin Gin Thr Val Lys Tyr Ala Asp Gin Ala 
65 70 75 80 

Tyr Xaa Phe Met Arg Asp Ala Ala Ala Asn Asp Ala Val Val Leu Phe 

B5 90 95 

Val Gly Thr Lys Lys Thr Ser Ser 
100 

(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:289: 

Met Val Asp Thr Asn Thr Asp Pro Asp Asp He Asp Val He He Pro 

15 10 15 

Ala Asn Asp Asp Ala He Arg Ala Val Lys Leu He Thr Ala Lys Leu 

20 25 30 

Ala Asp Ala He lie Glu Gly Arg Gin Gly Glu Asp Ala Val Ala Val 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 
Met Glu Glu Leu Gly Gin Ser lyr Gly Tyr Leu Leu Tyr Arg Thr Glu 
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Thr Asn Trp Asp Ala Glu Glu Glu Arg Leu Arg He He Asp Gly Arg 

20 25 30 

Asp Arg Ala Gin Leu Tyr Val Asp Gly Gin Trp Val Lys Thr Gin Tyr 

35 40 45 

Gin Thr Glu He Gly Glu Asp He Phe Tyr Gin Gly Lys Lys Lys Gly 

50 55 60 

Leu Ser Arg Leu Asp lie Leu He Glu Asn Met Gly Arg Val Asn Tyr 
65 70 75 so 

Gly His Lys Phe Leu Ala Asp Thr Gin Arg Lys Gly He Arg Thr Gly 

85 90 95 

Val Cys Lys Asp Leu His Phe Leu Leu Asn Trp Lys His Tyr Pro Leu 

100 105 no 

Pro Leu Asp Asn Pro Glu Lys lie Asp Phe Ser Lys Gly Trp Thr Gin 

115 120 125 

Gly Gin Pro Ala Phe Tyr Ala Tyr Asp Phe Thr Val Glu 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 291 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 68 amino acids 
CB) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

Met Gly Lys Glu Lys Val Trp Leu Pro Arg Lys Gin Gly Leu Pro Gly 

1 5 10 15 

Leu Ala Tyr Tyr Val lie Glu Val Ala His Lys Glu Glu Leu Leu Thr 

20 25 30 

lie Ala Gin Arg Ala Gin Glu Val Asp Val Pro He Lys Trp Met Thr 

35 40 45 

Ser Ser Gin Leu Glu lie Thr Asp Ser Asp Gly lie Val Thr Cys lie 

50 55 60 

Arg Leu Ala Arg 
65 



(2) INFORMATION FOR SEQ ID NO: 292: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 65 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:292: 



Met Asn Lys Lys Gin Trp Leu Gly 

1 5 
Val Gly Leu Ala Ala Cys Gly Asn 
20 

Ser Ser Asp Val Lys Thr Lys Ala 

35 40 
Val Asp Asp Lys Ser Phe Asn Gin 
50 55 

Gly 
65 



Leu Gly Leu Val Ala Val Ala Ala 

10 15 
Arg Ser Ser Arg Asn Ala Ala Ser 
25 30 
Ala He Val Thr Asp Thr Gly Gly 
45 

Ser Ala Trp Glu Val Ala Gly Leu 
60 



(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

Met Ser Gin Trp Asp Arg Lys Leu Asp Ala Arg Leu Ala Gin Ala Val 

! 5 10 15 

Val Ser He Asn Ala Phe Lys Gly Val Glu Phe Gly Leu Gly Phe Glu 

20 25 30 

Ala Gly Tyr Arg Lys Gly Ser Gin Val Met Asp Glu He Leu Trp Ser 

35 40 45 

Lys Glu Asp Gly Tyr Thr Arg Arg Thr Asn Asn Leu Gly Gly Phe Glu 

50 55 60 

Gly Gly Met Thr Asn Gly Gin Pro He Val Val Arg Gly Val Met Lys 
65 70 75 80 

Pro He Pro Thr Leu Tyr Lys Pro Leu Met Ser Val Asp He Glu Thr 
85 90 95 
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His Glu Pro Tyr Lys Ala Thr Val Glu Arg Ser Asp Pro Thr Ala Leu 

100 105 110 

Pro Ala Ala Gly Met Val Met Glu Ala Val Val Ala Thr Val Leu Ala 

115 120 125 

Gin Glu lie Leu Glu Lys Phe Ser Ser Asp Asn Leu Glu Glu Leu Lys 

130 135 140 

Glu Ala Val Ala Lys His Arg Asp Tyr Thr Lys Asn Tyr 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 294: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 



Met Glu Asp Ser Asn Ser Phe Met 

1 5 
Leu Lys Glu Ala Gly Phe Thr Lys 
20 

Leu Lys Asp Pro Ser Gin Thr Lys 

35 40 
Asn Lys Lys Tyr Thr Val Leu Ser 
50 55 



Leu Ser His Gin Met Glu Thr He 

10 15 
Ala Val Ser Tyr Phe He Leu Glu 
25 30 
Val Val Thr Glu Glu Leu Gin Lys 
45 



(2) INFORMATION FOR SEQ ID NO: 295: 



(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 



Met Gly Lys Lys Arg Trp Ala Arg Asn Gly Phe Glu Ser Asn Asp Ala 
15 10 15 
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Ser Tyr Ala Gin Val Val Ser Leu Tyr Asp Asp Thr Ser He Ser Val 

20 25 30 

Ser Asn Asn Glu Thr Asp Lys Val Leu Ala Gly Ser Leu Tyr Thr Asp 

35 40 45 

Thr Asn Glu Gin Gly Leu Thr He Pro Ser Ser Phe Thr Lys Lys Leu 
50 55 60 

Glu 
65 

(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 138 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

Met Ala Met Glu Ser Gly Ala Thr Ala Val Val Ala Glu Arg Gly Gin 

15 10 15 

Glu Arg He Thr Lys Val Arg Glu He Leu Gly Gly Gly Ala Asp Ala 

20 25 30 

Ala Leu Glu Cys Val Gly Thr Glu Ala Ala lie Glu Gin Ala Leu Gly 

35 40 45 

Val Leu His Asn Gly Gly Arg Met Gly Phe Val Gly Val Pro His Tyr 

50 55 60 

Asn Asn Arg Ala Leu Gly Ser Thr Phe Met Gin Asn He Ser Val Ala 
65 70 75 80 

Gly Gly Ala Ala Ser Ala Thr Thr Tyr Asp Lys Gin Phe Leu Leu Lys 

85 90 95 

Ala Val Leu Asp Gly Asp He Asn Pro Gly Arg Val Phe Thr Ser Ser 

100 105 HO 

Tyr Lys Leu Glu Asp He Asp Gin Ala Tyr Lys Asp Met Asp Glu Arg 

U 5 120 125 

Lys Thr He Lys Ser Met He Val He Glu 
130 135 

(2) INFORMATION FOR SEQ ID NO: 297: 



(i) SEQUENCE CHARACTERISTICS: 
( fi) LENGTH: 130 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNE5S : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 297: 



Met Lys Gin Val Val Asp Tyr Phe Leu Ser Gin Gly Met Asp Arg lie 

15 10 is 

Gly He Leu Thr Gly Leu Glu Glu Thr Thr Asp Gin Glu Glu He He 

20 25 30 

Gin Asp Lys Arg Leu Glu Asn Phe Lys Asn Tyr Ser Gin Ala Arg Gly 

35 40 45 

lie Tyr His Asp Glu Leu Val Phe Gin Gly Arg Phe Thr Ala Gin Ser 

50 55 60 

Gly Tyr Asp Leu Met Lys Glu Ala He Gin Ser Leu Gly Asp Gin Leu 
65 70 75 80 

Pro Pro Ala Phe Phe Ala Ala Ser Asp Ser Leu Ala He Gly Ala Leu 

85 90 95 

Arg Ala Leu Gin Glu Ala Gly He Ser Leu Pro Asp Arg Val Ser Leu 

100 105 no 

He Phe Leu Leu Thr Thr Leu Ser Leu Thr Lys Gin Val Tyr Pro Pro 
115 120 125 

Leu Ser 
130 



(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:298: 



Met Ala Lys Val Thr lie Met. Leu Ala Cys Ala Ala Gly Met Ser Thr 

15 10 is 

Ser Leu Leu Vel Thr Ly3 Met Gin Lys Ala Ala Glu Asp Lys Gly Leu 
20 25 30 
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Asp Ala Glu He Phe Ala Val Pro Ala Pro Glu Ala Glu Glu He Val 

35 40 45 

Ala Thr Lys Glu Val Asn Val Leu Leu Leu Gly Pro Gin Val Arg Tyr 

50 55 60 

Leu Leu Gly Asp Phe Gin Glu Lys Leu Lys Asp Arg Gin He Pro Val 
65 70 75 80 

Ala Val He Pro Met Thr Asp Tyr Gly Met Met Asn Gly Ser Lys Val 

85 90 95 

Leu Asp Leu Ala Glu Ser Leu Leu Asp 
100 105 

(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ii> MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

Met His Gly Ala Gly Val Asn Glu Pro Val Ala Glu Leu Ser Val Ala 

1 5 10 15 

Glu Gin Leu Leu Glu Ala Gly Ala Asp Val He Leu Val Pro Ala Val 

20 25 30 

Gly Thr Val Pro Ala Phe His Asp Gin Glu Leu Arg Glu Val Val Asp 

35 40 45 

Leu Val His Ser Lys Gly Gly Leu Val Leu Ser Ala He Gly Thr Ser 

50 55 60 

Gin Glu Thr Ser Asp Thr Asp Thr He Lys Glu He Ala Leu Arg Asn 
65 70 75 80 

Lys lie Cys Gly Val Asp He Gin His He Gly Asp Ala Gly Tyr Gly 

85 90 95 

Gly Leu Ala Thr Val Asp Asn lie Tyr Ala Leu Ser Lys Ala He Arg 

100 105 HO 

Gly Val Arg His Thr Val Ser Arg Leu Ala Arg Ser Val Asn Arg 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 3 00: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 6 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

Met Ser Ala lie Asn Leu Ala Ser Val Pro Lys Glu Ser Leu Thr Gin 

15 10 15 

Val Leu Pro Arg Asp Leu His Ala Glu Tyr Phe Ala Val Leu Ala Ser 

20 25 30 

He Ala Thr Ser He Glu Arg Met Ala Thr Glu He Arg Gly Leu Gin 

35 40 45 

Lys Ser Glu Gin Arg Glu Val Glu Glu Phe Phe Ala Lys Gly Gin Lys 

50 55 60 

Gly Ser Ser Ala Met Pro His Lys Arg Asn Pro He Gly Ser Glu Asn 
65 70 75 80 

Met Thr Gly Leu Ala Arg Val He Arg Gly His Met He Thr Ala Tyr 

85 90 95 

Glu Asn Val Ala Leu Trp His Glu Arg Asp He Ser His Ser Ser Ala 

100 105 110 

Glu Arg He He Thr Pro Asp Thr Thr He Leu He Asp Tyr Met Leu 

115 120 125 

Asn Arg Phe Gly Asn He Val Lys Asn Leu Thr Val Phe Pro Glu Asn 

130 135 140 

Met He Arg Asn Met Asn Ser Thr Phe Gly Leu lie Phe Ser Gin Arg 
145 150 155 160 

Ala Met Leu Thr Leu He Glu Lys Gly Met Thr Arg Glu Gin Ala Tyr 

165 170 175 

Asp Leu Val Gin Pro Lys Thr Ala Tyr Ser Trp Asp Asn Gin Val Asp 

180 185 190 

Phe Lys Pro Leu Leu Glu Ala Asp Ser Glu Val Thr Ser Arg Leu Thr 

195 200 205 

Gin Glu Glu lie Asp Glu lie Phe Asn Pro Val Tyr Tyr Thr Lys Arg 

210 215 220 

Val Asp Asp He Phe Glu Arg Leu Gly Leu Gly Asp 
225 23C 235 

(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

Met Leu Asn Leu Thr His Val Thr Leu Lys Thr Arg Gin Val lie Leu 

15 10 15 

Gin Asp Ala Asp Phe Thr Phe Lys Lys Gly Arg lie Tyr Gly Leu Leu 

20 25 30 

Ala He Asn Gly Ser Gly Lys Thr Thr Leu Phe Arg Ala Met Ser Lys 

35 40 45 

Leu Leu Pro Leu Ser Ser Gly His He Ala Val Pro Pro Ser Leu Phe 

50 55 60 

Tyr Tyr Glu Ser Val Glu Trp Leu Asp Gly Asn Leu Ser Gly Met Asp 
65 70 75 80 

Tyr Leu Arg Leu He Lys Lys His Leu Glu Val Arg Pro Lys Leu Glu 
85 90 95 

Arg 



(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

Met Tyr Thr Leu Gly Gin Pro Asn Glu Leu Ala Ala Glu Phe Leu Asn 

15 10 15 

Phe Val Leu Ser Asp Glu Thr Gin Glu Gly He Val Lys Gly Leu Lys 

20 25 30 

Tyr He Pro He Lys Glu Met Lys Val Glu Lys Asp Ala Ala Gly Thr 

35 40 45 

Val Thr Val Leu Glu Gly Arg Gin 
50 55 

(2) INFORMATION FOR 3EQ ID NO: 303: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:303: 



Met Asn Gin Glu Glu Leu Ala Lys Lys Met Leu Leu Pro Ser Lys Asn 

15 10 15 

Ser Arg Leu Glu Lys Leu Gly Lys Gly Leu Thr Phe Ala Cys Leu Ser 

20 25 30 

Leu lie Val He Leu Val Ala Met He Leu Val Phe Val Ala Gin Lys 

35 40 45 

Gly Leu Ser Thr Phe Phe Val Asn Gly Val Asn He Phe Asp Phe Leu 

50 55 60 

Leu Gly Gly Thr Trp Asn Pro Ser Ser Lys Glu Phe Gly Ala Leu Pro 
65 70 75 80 

Met He Leu Gly Ser Phe He Val Thr He Leu Ser Ala Leu lie Ala 

85 90 95 

Thr Pro Phe Ala He Gly Ala Ala Val Phe Met Thr Glu Val Ser Pro 

100 105 HO 

Lys Gly Ala Lys He Leu Gin Pro Ala He Glu Leu Leu Val Gly He 

115 120 125 

Pro Ser Val Val Tyr Gly Phe lie Gly Leu Gin Val Val Val Pro Phe 

130 135 140 

Val Arg Ser Val Phe Gly Gly Thr Gly Phe Gly He Leu Ser Gly He 
145 150 155 160 

Ser Val Leu Phe Val Met He Leu Pro Thr Val Thr Phe Met Thr Thr 

165 170 175 

Asp Ser Leu Arg Ala Val Pro Xaa Leu Leu Ser 
180 185 



(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

Met Thr Glu lie Arg Leu Glu His Val Ser Tyr Ala Tyr Gly Gin Glu 

1 5 10 15 

Arg He Leu Glu Asp He Asn Leu Gin Val Thr Ser Gly Glu Val Val 

20 25 30 

Ser He Leu Gly Pro Ser Gly Val Gly Lys Thr Thr Leu Phe Asn Leu 

35 40 45 

He Ala Gly He Leu Glu Val Gin Ser Gly Arg He Val Leu Asp Gly 

50 55 60 

Glu Glu Asn Pro Lys Gly His Val Ser Tyr Met Leu Gin Lys Asp Leu 
65 70 75 80 

Leu Leu Glu His Lys Thr Val Leu Gly Asn He He Leu Pro Leu Leu 

85 90 95 

He Gin Lys Val Asp Lys Ala Glu Ala He Ser Arg Ala Asp Lys He 

100 105 HO 

Leu Ala Thr Phe Gin Leu Thr Ala Val Arg Asp Lys Tyr Pro His Glu 

H5 120 125 

Leu Ser Gly Gly Met Arg Gin Arg Val Ala Leu Leu Arg Thr Tyr Leu 

130 135 140 

Phe Gly His Lys Leu Phe Leu Leu Asp Glu Ala Phe Ser Ala Leu Asp 
145 150 155 160 

Glu Met Thr Lys Met Glu Leu His Ala Trp Tyr Leu Glu He His Lys 

165 170 175 

Gin Leu Gin Leu Thr Thr Leu He Xaa Thr His Ser He Glu Glu Ala 

180 185 190 

Leu Xaa Leu Ser Asp Arg He Tyr He Leu Xaa Asn Xaa Pro Gly Gin 

195 200 205 

He Val Ser Glu He Lys Leu Asp Trp Ser Glu Asp Glu Asp Xaa Glu 

210 215 220 

Val Xaa Lys He Ala Xaa Lys Arg Gin He Leu Ala Glu Leu Gly Leu 
225 230 235 240 

Asp Lys 



(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 



Met Glu His Pro Xaa Glu Ala Ala 

1 5 
Glu Leu Lys Glu Lys Arg Asp Phe 
20 

Ser Lys Glu Tyr Ala Ser Asp Lys 

35 40 
Ala Arg Tip Asn Ala Phe Tyr Lys 

50 55 
Lys Glu Asp Leu Thr Asp Lys Gly 
65 70 



Asp lie Leu lie Lys Asn Ala Pro 

10 15 
Val lie Glu Ser Gin Lys Tyr Leu 
25 30 
Glu Lys Trp Gly Gin Phe Asp Ala 
45 

Trp Asp Lys Glu Asn Gly He Leu 
60 

Phe Thr Asn Glu Phe Val Lys 
75 



(2) INFORMATION FOR SEQ ID NO: 306: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 



Met Val Tyr Thr Ser Leu Ser Ser Lys Asp Gly Asn Tyr Pro Tyr Gin 

15 10 15 

Leu Asn He Ala His Leu Tyr Gly Asn Leu Met Asn Thr Tyr Gly Asp 

20 25 30 

Asn Gly Asn He Leu Met Leu Lys Tyr Val Ala Glu Lys Leu Gly Thr 

35 40 45 

His Val Thr Val Asp He Val Ser Leu His Asp Asp Phe Asp Glu Asn 

50 55 60 

His Tyr Asp He Ala Phe Phe Gly Gly Gly Gin Asp Phe Glu Gin Ser 
65 70 75 80 

He He Ala Asp Asp Leu Pro Ala Lys Lys Glu Ser He Asp Asn Tyr 

85 90 95 

He Gin Asn Asp Gly Val Val Leu Ala lie Cys Gly Gly Phe Gin Leu 

100 1C5 HO 

Leu Gly Gin Tyr Tyr Val Glu Ala Ser Gly Lys Arg He Glu Gly Leu 
115 120 125 
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Gly Val Met Gly His Tyr Thr Leu Asn Gin Thr Asn Asn Arg Phe lie 

130 135 140 

Gly Asp lie Lys He His Asn Glu Arg Phe Arg 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 307: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 



Met His Gin His Ser Pro His Thr 

1 5 
Ala Met Ser Lys Tyr Pro Glu Arg 
20 

Asn Glu Glu Ser Asp Phe Glu Leu 

35 40 
He Leu Asn Phe Lys Phe Val Gin 
50 55 



Ala Ser Tyr Thr Ser Ser Arg Asp 

10 15 
Cys Thr Thr Val Gly Leu Arg Leu 
25 30 
Tyr Ala Pro Tyr Gly Leu Glu Asp 
45 

Leu Leu He Ser 
60 



(2) INFORMATION FOR SEQ ID NO: 308: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 308: 



Met Phe Glu He Phe Asn Met Gly 

1 5 
Pro Glu Asn Val Glu Arg Val Lys 
20 

Glu He Gly Arg He Val Lys Lys 
35 40 



Val Gly Leu Met Leu Ala Val Ser 

10 15 
Glu Leu Leu Asp Glu Ala Val Tyr 
25 30 
Glu Asn Glu Ser Val He He Lys 
45 
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(2) INFORMATION FOR SEQ ID NO: 309: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

Met Met Val Lys Lys His Tyr Phe Lys Asp Asn Glu Glu Asp Ser Lys 

15 10 15 

Met Arg Arg He Leu Leu Leu Val Ala Val Leu Leu Met He Pro Ser 

20 25 30 

Phe He Ser Ala Thr Thr Leu Val Arg Glu Thr Leu Lys Lys Glu Ser 

35 40 45 

Leu Lys Lys Phe He Ser Glu Gin Phe Gin Gly His Asn He Leu Lys 

50 55 60 

Lys Thr Tyr Ser Lys Lys Thr His Thr Leu Lys Leu Thr He Ser Gly 
65 70 75 80 

Asn Tyr Leu Thr Glu Glu Glu Leu Asp Met He Ser Ser Lys Arg Gly 

85 90 95 

Asp Tyr Gly Leu Ser Asp Val Ser Val Gin Val Ser Gin Leu Ser Asp 

100 105 HO 

Ser Glu Gin Leu Ser Lys Glu Glu Leu Val Glu Tyr Phe Phe Gin Tyr 

115 120 125 

He Lys Asp Lys Glu Ala Lys G*lu Lys Glu Lys Ala Asn Lys Phe Tyr 

130 135 140 

Thr Glu Ser Glu Glu Gin 
145 150 

(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 



Met Ser He Glu Pro Arg Arg Arg Trp Arg Asn Ser Thr Val Cys Leu 

15 10 15 

Glu Ala Glu Leu Tyr Gin Glu Ser Leu Val Leu He Gly Gly Asp Pro 

20 25 30 

Gly He Gly Lys Ser Thr Leu Leu Leu Gin Val Ser Thr Gin Leu Ser 

35 40 45 

Gin Val Gly Thr Val Leu Tyr Val Ser Gly Glu Glu Ser Ala Gin Gin 

50 55 60 

He Lys Leu Arg Ala Glu Arg Leu Gly Asp Phe Asp Ser Glu Phe Tyr 
65 70 75 80 

Leu Tyr Ala Glu Thr Asn Met Gin Ser Val Arg Ala Glu Val Val Ala 

85 90 95 

Tyr Pro Ala Arg Leu Ser His Tyr 
100 



(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

Met Ala Thr Lys Gin Lys Glu Val Thr Thr Phe Asp Val Gin Val Ala 

15 10 15 

Glu Phe lie Arg Asn His Lys Gin Lys Gly Thr Ala Thr Asp Asp Glu 

20 25 30 

He Asn Ala Ser Leu Val He Pro Phe Thr Leu Asp Ala Asp Gly He 

35 40 45 

Glu Asp Leu Leu Gin Arg He Gin Asp Ala Gly He Ser He Thr Asp 

50 55 60 

Asn Glu Gly Asn Pro Ser Ala Arg Val Leu Ser Asn Glu Glu Glu Pro 
65 70 75 80 

Glu Leu Ser Asp Glu Asp Leu He Gly Ser Thr Ser Ala Lys Val Asn 

85 90 95 

Asp Pro Val Arg Met Tyr Leu Lys Glu He Gly Val Val Pro Leu Leu 
100 105 HO 
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Thr Asn Glu Glu Glu Lys Glu Leu Ala Leu Ala Val Glu Ala Gly Asp 

115 120 125 

lie Glu Ala Lys Gin Arg Leu Ala Glu Ala Asn Leu Arg Leu Val Val 

130 135 140 

Ser He Ala Lys Arg Tyr Val Gly Arg Gly Xaa Gin Ser Leu Thr 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 104 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 



Met Met Leu Lys Pro Ser He Asp 

1 5 
Lys Tyr Set Leu Val He Leu Glu 
20 

Ala Gly Ala Pro Ala Thr Gin Gly 

35 40 
Arg Ala Leu Glu Glu He Glu Ser 

50 55 
Pro Glu Gly Lys Arg Glu Ala Val 
65 70 
Arg Arg Lys Glu Glu Glu Glu Lys 
85 

Glu Lys Glu Asp Gly Glu Lys He 
100 



Thr Leu Leu Asp Lys Val Pro Ser 

10 15 
Ala Lys Arg Ala His Glu Leu Glu 
25 30 
Phe Lys Ser Glu Lys Ser Thr Leu 
45 

Gly Asn Val Thr He His Pro Asp 
60 

Arg Arg Arg He Glu Glu Glu Lys 

75 80 
Lys He Lys Glu Gin He Ala Lys 
90 95 



(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 144 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 



Met lie Arg Gin Gly Gin Met Leu Glu Tyr Ala Glu 

15 10 
Tyr Tyr Gly Thr Pro Leu Thr Tyr Val Asn Glu Thr 

20 25 
lie Asp Val Phe Leu Glu lie Glu Val Gin Gly Ala 

35 40 
Lys Lys Val Pro Asp Ala Val Phe lie Phe Leu Thr 

50 55 60 

Asp Glu Leu Gin Glu Arg Leu Val Gly Arg Gly Thr 
65 70 75 

Val He Ala Gin Arg He Glu Lys Ala Lys Glu Glu 

85 90 
Arg Glu Tyr Asp Tyr Ala He Val Asn Asp Gin Val 

100 105 
Glu Arg Val Lys Cys Val He Glu Ala Glu His Phe 

115 120 
Val He Gly His Tyr Gin Glu Met Leu Pro Lys Ser 
130 135 140 



Tyr Val Gly Asn 
15 

Leu Asp Lys Gly 
30 

Leu Gin Val Lys 
45 

Pro Pro Asp Leu 

Asp Ser Ala Glu 
80 

He Ala Leu Met 
95 

Pro Leu Ala Ala 
110 

Cys Val Asp Arg 
125 

Pro Thr Thr Arg 



(2) INFORMATION FOR SEQ ID NO: 314: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: 

Met Ser Arg Leu Leu Val He Gly 

1 5 
He Ser Lys He Cys Gin Asp Ser 
20 

Ala Ser Arg Thr Lys Ser Lys Cys 

35 40 
Gly Lys Thr Ser Thr Lys lie Glu 

50 55 
Val Glu Glu Val He Ala Leu He 
65 70 
Phe 



SEQ ID NO: 314: 

Cys Gly Gly Val Ala Gin Val Ala 

10 15 
Glu Thr Phe Thr Glu He Met He 
25 30 
Asp Asp Leu Lys Ala Lys Leu Glu 
45 

Thr Ala Ala Leu Asp Ala Asp Lys 
60 

Glu Ser Leu Gin Thr Ser Lys Leu 
75 80 
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(2) INFORMATION FOR SEQ ID NO:315: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 315: 



Met His Glu Ser Phe His Val Asp Asp Pro Thr Leu Tyr Ser Arg Glu 

15 10 15 

Trp Phe Ser Trp Ala Asn Met Met Phe Cys Glu Leu Val Leu Asp Tyr 

20 25 30 

Leu Asp lie Arg 
35 



(2) INFORMATION FOR SEQ ID NO: 316: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 



Met Arg Phe Tyr Phe Met Glu Asn 

1 5 
Ser His Trp Asp Arg Glu Trp Tyr 
20 

Gin Leu Val Glu Leu Phe Asp Asn 

35 40 
Pro Glu Phe Lys Ser Phe His Leu 
50 55 

Asp 
65 



Val Val Val His lie He Ser His 

10 15 
Leu Pro Phe Glu Ser His Arg Met 
25 30 
Leu Phe Asp Leu Phe Glu Asn Asp 
45 

Asp Gly Gin Thr He Val Leu Asp 
60 
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<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

Met Leu Leu Tyr He Leu Ser His He Ala Thr Gly He Val Ser Gly 

15 10 15 

Thr Cys Leu Leu Lys Ala He Val Cys Asn Trp Tip Asn Cys Leu Thr 

20 25 30 

He Ser Leu He Ser Leu Lys Met Thr Leu Ser Ser Arg Val Ser Thr 

35 40 45 

Trp Met Asp Lys Leu Leu Ser Leu Thr Thr Asn Leu Gin He Arg Pro 

50 55 60 

Glu Asn Arg Asp Lys Val Gin Arg Tyr He Asp Glu Gly Lys Leu Lys 
65 70 75 80 

He Gly Pro Phe Tyr He Leu Gin Asp Asp Tyr Leu He Ser Ser Glu 

85 90 95 

Ala Asn Val Arg Asn Thr Leu He Gly Gin Gin Glu Ala Ala Lys Trp 

100 105 110 

Gly Lys Ser Thr Gin He Gly Tyr Phe Pro Asp Thr Phe Gly Asn Met 

115 120 125 

Gly Gin Ala Pro Gin He Leu Gin Lys Ser Gly lie His Val Ala Ala 

130 135 140 

Phe Gly Arg Gly Val Lys Pro He Gly Phe Asp Asn Gin Val Leu Glu 
145 150 155 160 

Asp Glu Gin Phe Thr Ser Gin Phe Ser Glu Met Tyr Trp Gin Gly Val 

165 170 175 

Asp Gly Ser Arg Val Leu Gly He Leu Phe Ala Asn Trp Tyr Ser Asn 

180 185 190 

Gly Asn Glu He Pro Val Asp Lys Asp Glu Ala Leu Thr Phe Trp Lys 

195 200 205 

Gin Lys Leu Ser Asp Val Arg Cys Leu Arg Phe Asp Gin Pro Met Val 

210 215 220 

Asp Asp Glu Thr Ala Val Asn Thr Ser Leu Ser Gin Lys Lys Ser Glu 
225 230 235 240 

Arg Ser His Ser Val Trp Gin Met Asn Ser Ser Arg Met 
245 250 
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(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

Met Thr lie Val Gly Cys Arg He Asp Gly Arg Leu He His Gly Gin 

15 10 15 

Val Ala Asn Leu Trp Ala Gly Lys Leu Asn Val Ser Arg He Met Val 

20 25 30 

Val Asp Asp Glu Val Val Asn Asn Asp He Glu Lys Ser Gly Leu Lys 

35 40 45 

Leu Ala Thr Pro Pro Gly Val Lys Leu Ser He Leu Pro Val Glu Lys 

50 55 60 

Ala Ala Ala Asn He Leu Ala Gly Lys Tyr Asp Ser Gin Arg Leu Phe 
65 70 75 80 

He Val Ala Arg Lys Pro Asp Arg Phe Pro Trp Phe Gly Arg Ser Arg 
85 90 95 

Cys Thr Thr 



(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

Met He Gin His Pro Arg He Gly He Arg Pro Thr He Asp Gly Arg 

15 10 15 

Arg Gin Gly Val Arg Glu Ser Leu Glu Val Gin Thr Met Asn Met Ala 
20 25 30 
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Lys Ser Val Ala Asp Leu lie Ser Ser Thr Leu Lys Tyr Pro Asp Gly 

35 40 45 

Glu Pro Val Glu Cys Val lie Ser Pro Ser Thr lie Gly Arg Val Pro 

50 55 60 

Glu Ala Ala Ala Ser His Glu Leu Phe Lys Lys Ser Asn Val Cys Ala 
65 70 75 80 

Thr He Thr Val Thr Pro Cys Trp Cys Tyr Gly Ser Glu Thr Met Asp 

85 90 95 

Met Ser Pro Asp He Pro His Ala He Trp Gly Phe Asn Gly Thr Glu 

100 105 110 

Arg Pro Gly Ala Val Tyr Leu Ala Ala Val Leu Ala Ser His Ala Gin 

115 120 125 

Lys Gly He Pro Ala Phe Gly He Tyr Gly Arg Asp Val Gin Glu Ala 

130 135 140 

Asn Asp Thr Asp He Pro Glu Asp Val Lys Glu Asn Phe 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 320: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 



Met Leu Leu Ser Thr Lys Gly He 

1 5 
Thr Leu Glu Arg Leu Phe Ser Met 
20 

Arg His Pro Gly Tyr Arg Lys Tyr 

35 40 
Thr Glu Lys 
50 



Glu Glu Gin Glu He Ala Arg Pro 

10 15 
Arg Glu Asn Tyr Lys Val Thr Gly 
25 30 
Asn Gly Asp Gly Ser Met Lys Glu 
45 



(2) INFORMATION FOR SEQ ID NO: 321: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

JC) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:321: 

Met Leu Thr Leu Thr Pro Asp Val lie Tyr Gin Glu Pro Asp lie Leu 

15 10 15 

Tyr Gin His Glu Asp Phe Ser Leu Val Lys lie Arg Gin lie Arg Phe 
20 25 30 

Cys 



(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:322: 

Met Ser Ser Val Val Gly Leu Met Gly Asn He Gly Gin Ala Asn Tyr 

15 10 15 

Ala Ala Ser Lys Ala Gly Leu He Gly Phe Thr Lys Ser Val Ala Arg 

20 25 30 

Glu Val Ala Ser Arg Asn He Arg Val Asn Val He Ala Pro Gly Asn 
35 40 45 

Asp 



(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 127 amino acids 
<B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 



Met Lys Leu Glu His Lys Asn lie Phe lie Thr Gly Ser Ser Arg Gly 

15 10 15 

lie Gly Leu Ala lie Ala His Lys Phe Ala Gin Ala Gly Ala Asn lie 

20 25 30 

Val Leu Asn Ser Arg Gly Ala lie Ser Glu Glu Leu Leu Ala Glu Phe 

35 40 45 

Ser Asn Tyr Gly lie Lys Val Val Pro lie Ser Gly Asp Val Ser Asp 

50 55 60 

Phe Ala Asp Ala Lys Arg Met lie Asp Gin Ala lie Ala Glu Leu Gly 
65 70 75 80 

Ser Val Asp Val Leu Val Asn Asn Ala Gly lie Thr Gin Asp Thr Leu 

85 90 95 

Met Leu Lys Met Thr Glu Ala Asp Phe Glu Lys Val Leu Lys Val Asn 

100 105 110 

Leu Thr Gly Ala Phe Asn Met Thr Gin Ser Val Phe Gly Asn Arg 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 324: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

( B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:324: 



Met Glu Ser Met Pro Arg lie Gly 

1 5 
Asp Thr Glu Thr lie Phe Asn Ala 
20 

Lys Ala Ser Ala lie Ala Tyr lie 

35 40 
Glu Asn Ala Gly Glu Met Gly Arg 

50 55 
Ser Val 
65 



Leu Leu Val Thr Val Glu Asn Arg 

10 15 
Ala Gly Leu Asp Phe Asp Val Leu 
25 30 
Asn Ala Asn Thr Phe Val Gin Lys 
45 

Ser Val Ser Tyr His Asp Met Arg 
60 



(2) INFORMATION FOR SEQ ID NO: 325: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

Met Arg Gin Gly Gin Gly Ser Val Glu Ala lie Phe Asn Ala lie Asp 

15 10 is 

Lys Phe Phe Asn Gin Ser Val Arg Leu Val Ser Tyr Thr He Asn Ala 

20 25 30 

Val Thr Asp Gly He Asp Ala Gin Asp Arg Val Val Gly His Cys 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

Met Val Glu Asn Pro Glu Gly Phe His Phe Asp Asp Leu Gin Leu Gin 

1 5 10 15 

Thr His Ala Asp Asn Asp He Glu Ala Leu Val Ser Leu- Ala Asn Met 

20 25 30 

Asp Gly Glu Lys Val Glu Phe Asn Ala Thr Gly Thr Gly Phe Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 210 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 



Met Asn Phe Gin Leu Ala Lys Tyr Ser Leu Leu Lys Lys Phe Ser Glu 

15 10 15 

Asn lie Gly Phe Thr Thr Pro Glu Glu Cys Gly Ala lie Phe Gin Tyr 

20 25 30 

Leu lie Glu Asn Val Gin Thr Asp Arg Gin He He Tyr Ser Pro Pro 

35 40 45 

Cys His Asp Glu Leu Arg Met Ala Val Ala Asn Ser Leu Ala Ala Val 

50 55 60 

Lys Asn Gly Ala Gly Leu Phe Glu Glu Thr He His Gly He Arg Glu 
65 70 75 80 

Arg Ala Glu Asn Ala Ala Leu Glu Glu He Ala Val Ala Leu Asn He 

85 90 95 

Arg Gin Asp Tyr Tyr Gin Val Glu Thr Ser He Val Leu Asn Glu Thr 

100 105 110 

lie Asn Thr Ser Glu Met Val Ser Arg Phe Ser Gly He Pro Val Pro 

115 120 125 

Lys Asn Lys Ala Val Val Gly Gly Asn Thr Phe Ser His Glu Ser Gly 

130 135 140 

lie His Gin Asp Gly Val Leu Lys Asn Pro Leu Thr Tyr Glu He He 
145 150 155 160 

Thr Pro Glu Leu Val Gly Val Lys lie Pro Leu Gly Lys Leu Ser Gly 

165 170 175 

Arg His Ala Phe Val Glu Lys Leu Arg Glu Leu Ala Leu Asp Phe Thr 

180 185 190 

Glu Glu Asp lie Lys Pro Phe Phe Ala Lys Phe Lys Ala Leu Ala Asp 
195 200 205 

Lys Lys 
210 



(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 
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Met Ser Gin Gin Val Lys Asn Ala His Asn Leu Tyr He His Ala He 

15 10 is 

Gin Asp Gly Arg Val Ala Glu Ala Gin Ala Gin Ser Val Gly Asp Thr 

20 25 30 

Tyr He Gin His Ser Thr Gly Val Pro Asp Gly Lys Glu Gly Phe Ala 

35 40 45 

Ala Phe Phe Ala Asp Phe Phe Glu Arg His Pro Glu Arg Gin He Lys 

50 55 60 

He Val Arg Thr He Glu Asp Gly Asn Leu Val Phe Val His Val Pro 
65 70 75 80 

Ser lie Ser Glu Trp Trp Arg Ser Ser Met Gly Asp Asp Gly Tyr Phe 
85 90 95 

Pro Cys Gly 



(2) INFORMATION FOR SEQ ID NO:329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS : single 
<D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

Met Glu Met Gin Thr Ser Ala Gin Leu Leu Thr Asn Lys He Phe Leu 

1 5 10 15 

Lys Asn Pro Leu Lys Ala Leu Val Glu Glu Lys Tyr Gly He Glu Tyr 

20 25 30 

Glu Glu Phe Thr Asn Pro Trp His Ala Ala He Ser Ser Phe Val Ala 

35 40 45 

Phe Phe Leu Arg Ser Leu Pro Pro Met Leu Ser Val Thr He Phe Pro 

50 55 60 

Ser Glu Tyr Arg He Pro Ala Thr Val Leu He Val Gly Val Ala Leu 
65 70 75 80 

Leu Leu Thr Gly Tyr Thr Ser Ala Arg Leu Gly Lys Asp Pro Thr Arg 

85 90 95 

Thr Ala Met He Arg Asn Leu Ala He Gly Leu Leu Thr Met Gly Val 

100 105 HO 

Thr Phe Leu Leu Glu Gin Leu Phe Ser He 
115 120 
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(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

Met Ala Phe Phe Leu Cys Pro His Phe Arg Ser Asn His Trp Lys lie 

15 10 15 

Leu Thr Val Ser Asp Thr Met Glu Glu Lys Arg Leu Glu Tyr Pro Met 

20 25 30 

Val Thr Phe Leu Gly Asn Pro Val Ser Phe Thr Gly Lys Gin Leu Gin 

35 40 45 

Val Gly Asp Lys Ala Leu Asp Phe Ser Leu Thr Thr Thr Asp Leu Ser 

50 55 60 

Lys Lys Ser Leu Ala Asp Phe Asp Gly Lys Lys Lys Val Leu Ser Val 
65 70 75 80 

Val Pro Ser lie Asp Thr Gly lie Cys Ser Thr Gin Thr Arg Arg Phe 

85 90 95 

Asn Glu Glu Leu Ala Gly Leu Asp Asn Thr Val Val Leu Thr Val Ser 

100 105 110 

Met Asp Leu Pro Phe Ala Gin Lys Arg Trp Cys Gly Ala Glu Gly Leu 

115 120 125 

Asp Asn Ala He Met Leu Ser Asp Tyr Phe Asp His Ser Phe Gly Arg 

130 135 140 

Asp Tyr Ala Leu Leu He Asn Glu Trp His Leu Leu Ala Arg Ala Val 
145 150 155 160 

Phe Val Leu Asp Thr Asp Asn Thr He Arg Tyr Val Glu Tyr Val Asp 

165 170 175 

Asn He Asn Ser Glu Pro Asn Phe Glu Ala Ala He Ala Ala Ala Lys 
180 185 190 

Ala Leu 



(2) INFORMATION FOR SEQ ID NO:331: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 48 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 



Met Gly Tyr Arg Pro Ser Thr Ala Asn Ala He He His Gin Val Arg 

15 10 15 

Glu Leu Leu Val Ser Arg Gly Tyr Thr Phe Tyr Asn Arg Lys Arg Leu 

20 25 30 

Met Val Val Pro Lys Ser Val Val Lys Glu Leu Leu Gly Met Glu Leu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



fii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332 : 



Met Gin Ala Val Glu His Phe He Lys Gin Phe Val Pro Glu His Tyr 

15 10 15 

Asp Leu Phe Leu Asp Leu Ser Arg Glu Thr Lys Thr Phe Ser Gly Lys 

20 25 30 

Val Thr He Thr Gly Gin Ala Gin Ser Asp Arg He Ser Leu His Gin 

35 40 45 

Lys Asp Leu Glu He Thr Ser Val Glu Val Ala Gly Gin Ala Arg Pro 

50 55 60 

Phe Thr Val Asp His Asp Asn Glu Ala Leu His He Glu Leu Ala Glu 
65 * 7 0 75 80 

Ala Gly Gin Val Glu Leu Val Leu Ala Phe Ser Gly Lys He Thr Asp 

65 90 95 

Asn Met Thr Gly He Tyr Pro Ser Tyr Tyr Thr Val Asp Gly Val Lys 

100 105 no 

Lys Glu Val Leu Ser Thr Gin Phe Glu Ser His Phe Ala Arg Glu Ala 
115 120 125 
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Phe Pro Cys Val Asp Glu Pro Glu Ala Lys Ala 

130 135 
Leu Arg Phe Asp Gin Ala Glu Gly Glu Leu Ala 
145 150 155 

Glu lie Asp Val Glu Asn Arg Lys Glu Thr Gly 

165 170 
Thr Thr Pro Arg Met Ser Ser Tyr Leu Leu Ala 

180 185 
Leu Gin Gly Val Thr Ala Lys Thr Lys Asn Gly 

195 200 
Leu Leu Asn Gin Ser Thr Ser Thr Phe Lys Ser 
210 215 

(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

Met Ser Gin Glu Phe Leu Ala Arg lie Leu Glu Gin Lys Ala Arg Glu 

15 10 15 

Val Glu Gin Met Lys Leu Glu Gin lie Gin Pro Leu Arg Gin Thr Tyr 

20 25 30 

Arg Leu Ala Glu Phe Leu Lys Asn His Gin Asp Arg Leu Gin Val lie 

35 40 45 

Ala Glu Val Lys Lys Ala Ser Pro Ser Phe Gly Glu He Ser He Ser 

50 55 60 

Met Trp He Leu Cys Asn Arg Pro Arg Leu Met Lys Lys Thr Glu Gin 
65 70 75 80 

(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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Thr Phe Asp Leu Ser 
140 

Leu Ser Asn Met Pro 
160 

He Trp Lys Phe Glu 
175 

Phe Val Ala Gly Asp 
190 

Thr Leu Val Gly Cys 
205 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

Met Ser Ser Phe Thr Pro Glu Asp Leu Gly Met Glu Gly Tyr Ala Met 

15 10 15 

Glu Asp lie Arg Gly Gly Asn Ala Gin Glu Asn Ala Glu lie Leu Leu 

20 25 30 

Ser Val Leu Lys Asn Glu Ala Ser Pro Phe Leu Glu Thr Thr Val Leu 

35 40 45 

Asn Ala Gly Leu Gly Phe Tyr Ala Asn Gly Lys lie Asp Ser lie Lys 

50 55 60 

Glu Gly Val Ala Leu Ala Arg Gin Val lie Ala Arg Gly Lys Ala Leu 
65 70 75 80 

Glu Lys Leu Arg Leu Leu Gin Glu Tyr Gin Lys 



(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:335: 

Met lie Tyr Thr Val Thr Leu Asn Pro Ser lie Asp Tyr lie Val Arg 

15 10 15 

Leu Asp Gin Val Lys Val Gly Ser Val Asn Arg Met Asp Ser Asp Asp 

20 25 30 

Lys Phe Ala Gly Gly Lys Gly lie Asn Val Ser Arg Val Leu Lys Arg 

35 40 45 

Leu Asn lie Ser Asn Thr Ala Thr Gly Phe He Gly Gly Phe Thr Gly 

50 55 60 

Lys Phe Xaa Xaa Asp Thr Leu Ala Glu Glu Glu He Glu Xaa Arg Phe 
65 70 75 80 

Val Gin Val Ala Glu Asp Thr Arg He Asn Val Lys He Xaa Ala Asp 

85 90 95 

Gin Glu Thr Glu He Asn Gly Thr Gly Pro Thr Val Glu Pro Val Lys 



85 



90 



100 



105 



110 
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Leu Glu Glu Leu Lys Ala lie Leu Ser Ser Leu Thr Ala Glu Asp Thr 

115 120 125 

Val Val Phe Ala Gly Ser Ser Ala Lys Asn Leu Gly Asn Val lie Tyr 

130 135 140 

Lys Gly Phe Asn Leu Leu Asp Ala Pro Asp Trp Cys Ala Ser Gly Leu 
145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 336: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 133 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

Met Met Gin Val Tyr Phe Asp Gin Gly lie Tyr Asn Lys Lys Ala Val 

15 10 15 

Phe Glu Val Tyr Phe Arg Gin Gin Pro Phe Lys Asn Gly Tyr Ala Val 

20 25 30 

Phe Ala Gly Leu Glu Arg lie Val Asn Tyr Leu Glu Asp Leu Arg Phe 

35 40 45 

Ser Asp Ser Asp He Ala Tyr Leu Glu Ser Leu Gly Tyr His Gly Ala 

50 55 60 

Phe Leu Asp Tyr Leu Arg Asn Phe Lys Leu Glu Leu Thr Val Arg Ser 
65 70 75 80 

Ala Gin Glu Gly Asp Leu Val Phe Ala Asn Glu Pro He Val Gin Val 

85 90 95 

Glu Gly Pro Leu Ala Gin Cys Gin Leu Val Glu Thr Ala Leu Leu Asn 

100 105 HO 

He Val Asn Tyr Gin Thr Leu Val Ala Thr Lys Ala Ala Pro Tyr Pro 

115 120 125 

Phe Gly Tyr Arg Lys 
130 



(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE 



TYPE: protein 



(xi> SEQUENCE 



DESCRIPTION: SEQ ID NO: 3 37: 



Met Lys lie lie lie Gin Arg Val Lys Lys Ala Gin Val Ser lie Glu 

15 10 15 

Gly Gin lie Gin Gly Lys lie Asn Gin Gly Leu Leu Leu Leu Val Gly 

20 25 30 

Val Gly Pro Glu Asp Gin Glu Glu Asp Leu Asp Tyr Ala Val Arg Lys 

35 40 45 

Leu Val Asn Met Arg lie Phe Ser Asp Val Glu Gly Lys Met Asn Leu 

50 55 60 

Ser Val Lys Asp lie Glu Gly Glu lie Leu Ser lie Leu Ser Leu Pro 
65 70 75 80 

Leu Cys Gly Tyr 



(2) INFORMATION FOR SEQ ID NO: 338: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

Met He Lys Gly Thr He Lys Thr Val Ser He Met Ala Ala Thr Ser 

15 10 15 

Gly Phe Leu Leu Tyr Asn Glu Val Phe Phe Leu Thr Asn Gly Ala Ala 

20 25 30 

Gly Thr Lys Ser He Ser Phe Val He Arg Glu Leu Ala Val Ala Ser 

35 40 45 

Ser Arg Thr Gin Tyr Ala Arg Ala Asn Thr He Gly Val He Gin He 

50 55 60 

Leu Gly Gly Met Leu He He Val Cys He Asn He Leu Phe Arg Glu 
65 70 75 80 

Arg Lys Arg Leu Lys Gly Gly Lys 



85 
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(2) INFORMATION FOR SEQ ID NO: 339: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 



Met Gly 
1 

Asp Lys 

Asn Gin 

Val Met 

50 
Ser Phe 
65 

Asp Phe 

Asp Asn 

Val Pro 

Pro Met 
130 
Phe His 
145 



Phe Gly He 
5 

Glu Val Val 
20 

Glu Leu Ala 
35 

Leu Asn Asn 

Tyr Glu Gly 

Gin Leu Met 
85 

Pro Glu Thr 

100 
Met Leu He 
115 

Val Pro Ala 
Ala 



Pro Ala 

Leu Phe 

He Leu 

His Ser 

55 
Arg Thr 
70 

Ala Gin 

Leu Ala 

Glu Val 

Gly Lys 
135 



Ala He Gly Ala 
10 

Val Gly Asp Gly 
25 

Asn He Tyr Lys 
40 

Leu Gly Met Val 

Ser Glu Ser Val 
75 

Ala Tyr Gly He 
90 

Gin Asp Leu Glu 
105 

Asp He Ser Arg 
120 

Ser Asn His Glu 



Lys He 

Gly Phe 

Val Pro 

45 
Arg Gin 
60 

Phe Asp 

Lys Asn 

Ala Thr 

Lys Glu 
125 
Met Leu 
140 



Ala Asn Pro 
15 

Gin Met Thr 
30 

He Lys Val 

Trp Gin Glu 

Thr Leu Pro 
80 

Tyr Lys Phe 
95 

Thr Glu Asp 
110 

Gin Val Leu 
Gly Val Lys 



(2) INFORMATION FOR SEQ ID NO: 340: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 
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Met Trp Leu Pro Leu Asp Arg Asn Asn Lys Gly Gin Asn lie Ala Gly 

1 5 10 15 

Ala Arg Gin Ala Ala Glu Gly He Phe Gly Val Asp Ala Ser Gin Leu 

20 25 30 

Thr Val Pro Gin Ala Ala Phe Leu Ala Gly Leu Pro Gin Ser Pro He 

35 40 45 

Thr Tyr Ser Pro Tyr Glu Asn Thr Gly Glu Leu Lys Asn 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 341: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 



Met Tyr Leu Gly Asp Leu Met Glu 

1 5 
He Leu Ser Phe Leu Leu Gin Glu 
20 

Met Glu Glu Thr Gly Phe Ser Lys 

35 40 
Leu Leu Asn Asp Lys Ala Leu Asp 

50 55 
Ser Glu Asp Glu Asn Leu Arg Leu 
65 70 
Arg Asp He Pro Glu Leu Val Phe 
85 

Phe Trp Phe He Phe Ser Thr Thr 
100 



Lys Ala Glu Cys Gly Gin Phe Ser 

10 15 
Ser Gin Thr Thr Val Lys Ala Val 
25 30 
Ala Thr Leu Thr Lys Tyr Val Thr 
45 

Ser Gly Leu Glu Leu Thr He His 
60 

Ser He Gly Ala Ala Thr Lys Gly 

75 80 
Trp He Val Leu Leu Asn Thr Arg 

90 95 
Asn Ser Phe 
105 



(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:342: 

Met Lys Glu lie He Glu Lys Leu Ala Lys Phe Glu Asn Leu Ser Gly 

15 10 15 

Val Glu Met Thr Asp Val He Glu Arg lie Val Thr Gly Arg Val Thr 

20 25 30 

Glu Ala Gin He Ala Ser Leu Leu Leu Ala Leu Lys Met Lys Gly Glu 

35 40 45 

Thr Pro Glu Glu Arg Thr Ala He Ala Gin Val Met Arg Gly His Ala 

50 55 60 

Gin His He Pro Thr Glu He His Asp Ala Met Asp Asn Cys Gly Thr 
65 70 75 80 

Gly Gly Asp Lys Ser Phe Ser Phe Asn He Ser Thr Thr Ala Ala Phe 

85 90 95 

Val Leu Ala Gly Gly Gly He His Met Ala Lys His Gly Asn Arg Ser 

100 105 110 

He Ser Ser Lys Ser Gly Ser Ala Asp Ser Xaa Asn Leu Gly Asn Gin 
115 120 125 

Ser 



(2) INFORMATION FOR SEQ ID NO: 3 43: 

(i) SEQUENCE CHARACTERISTICS : 
{A) LENGTH: 13 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 343: 

Met Asn Val Gin Met Ser Ser Lys Thr Asn He Leu Arg Ala His Ala 

15 10 15 

Glu Met Gin Asn He Gin Arg Arg Ala Asn Glu Glu Arg Gin Asn Leu 

20 25 30 

Gin Arg Tyr Arg Ser Gin Asp Leu Ala Lys Ala He Leu Pro Ser Leu 

35 40 45 

Asp Asn Leu Glu Arg Ala Leu Ala Val Glu Gly Leu Thr Asp Asp Val 
5C 55 60 

266 



WO 97/37026 PCT/US97/O5306 

Lys Lys Gly Leu Gly Met Val Gin Glu Ser Leu He His Ala Leu Lys 
65 70 75 80 

Glu Glu Gly He Glu Glu He Ala Ala Asp Gly Glu Phe Asp His Asn 

85 90 95 

Tyr His Met Ala lie Gin Thr Leu Pro Ala Asp Asp Asp His Pro Val 

100 105 no 

Asp Thr He Ala Gin Val Phe Gin Lys Gly Tyr Lys Leu His Asp Arg 

115 120 125 

He Leu Arg Pro Ala Met Val Val Val Tyr Asn 
130 135 

(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 344: 

Met Ser Asn Lys Ser Tyr Ser Lys Phe Gin Glu Glu Val Ser Leu Lys 

1 5 io 15 

Tyr Gly Phe He Gly Leu Lys Leu Asp Lys Leu Ser Leu Thr Ala Glu 

20 25 30 

Val Ser Glu Glu Phe His Ser Glu He Leu Ser Gly Asn Phe Thr Leu 

35 40 45 

Tyr Asp He Ser Val 
50 

(2) INFORMATION FOR SEQ ID NO: 345 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: S2Q ID NO:345: 
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Met His Phe Asp Lys Ser Lys Phe Gly Ala Val Phe Ser Ala Pro Gly 

15 10 15 

Leu Tyr Glu Val Glu Val lie Asn Asn Ala Ser Phe Gly Gin Asn Ala 

20 25 30 

Gin Tyr Glu Val lie Gin Ser Arg Lys Leu Gly Thr Phe Ala Glu Leu 

35 40 45 

He Glu Met Ala Lys He Lys 

50 55 

(2) INFORMATION FOR SEQ ID NO: 346: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

Met He Gin Ala Val Phe Glu Arg Ala Glu Asp Gly Glu Leu Arg Ser 

15 10 15 

Ala Glu He Thr Gly His Ala Glu Ser Gly Glu Tyr Gly Leu Asp Val 

20 25 30 

Val Cys Ala Ser Val Ser Thr Leu Ala He Asn Phe He Asn Ser He 

35 40 45 

Glu Lys Phe Ala Gly Tyr Glu Pro He Leu Glu Leu Asn Glu Asp Glu 

50 55 60 

Gly Gly Tyr Leu Met Val Glu lie Pro Lys Asp Leu Pro Ser His Gin 
65 70 75 80 

Arg Glu Met Thr Gin Leu Phe Phe Glu Ser Phe Phe Leu Gly Met Ala 

85 90 95 

Asn Leu Ser Glu Asn Ser Ser Glu Phe Val Gin Thr Arg Val He Thr 
100 105 110 

Glu Asn 



(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 



Met Thr Ala He Ser Met Lys Trp Leu Asn Asp Tyr He Trp Pro Ala 

IS 10 15 

Glu Ser Glu Phe Thr Pro Asp Met Thr Thr Asn Ala Val Lys Glu Ala 

20 25 30 

Leu Thr Glu Met Leu Gin Ser Gly Thr Thr Thr Phe Asn Asp Met Tyr 

35 40 45 

Asn Pro Asn Gly Val Asp He Gin Gin He Tyr Gin Val Val Lys Thr 

50 55 60 

Ser Lys Met Arg Cys Tyr Phe Ser Pro Thr Leu Phe Ser Ser Glu Thr 
65 70 75 80 

Glu Thr Thr Ala Glu Thr He Ser Arg Thr Arg Ser He He Asp Glu 

85 90 95 

He Leu Lys Tyr Lys Asn Pro Lys Phe Gin Gly Leu Trp 
100 105 

(2) INFORMATION FOR SEQ ID NO: 348: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

Met Val Arg Glu Ser Ala Glu Ser Ala Gly Phe Phe Leu Glu Thr His 

1 5 io is 

Met Val Gin Gly Glu Trp Asn Thr Cys Val Phe Lys Lys Thr Lys Asp 

20 25 30 

He Ser Gly Val He Gly Gly 
35 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 49 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

Met Thr Val Asp Arg Glu Gly Phe Glu Ala Ala Met Lys Glu Gin Gin 

15 10 15 

Glu Arg Ala Arg Ala Ser Ala Val Lys Gly Gly Ser Met Gly Met Gin 

20 25 30 

Asn Gly Asn Ser Ser Lys His His Cys Arg Lys Cys Leu Gin Leu Gin 
35 40 45 

Cys 



(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

Met Lys Ala Gin Leu Val Pro Ser Arg Ser Lys Tyr Leu He Asn Asp 

15 10 15 

Asn Thr Val Val Leu Phe Phe Gly Thr Arg Thr Asp Tyr Thr Arg Lys 

20 25 30 

Tyr Asp Leu Asp Leu Val Arg Glu Val Ala Gly Asp Gin He Ala Arg 

35 40 45 

Arg Val Val Leu Leu Ser Asp Gin Ala Phe Gly Leu Glu Asn Val Lys 

50 55 60 

Glu Val Ala Leu Gly Cys Gly Gly Val Leu Asn Asp He Tyr Arg Val 
65 70 75 80 

Phe Pro Tyr He Val Tyr Ala Gin Leu Phe Ala Leu Leu Thr Ser Leu 

85 90 95 

Lys Val Glu Asn Lys Pro Asp Tnr Pro Ser Pro Thr Gly Thr Val Asn 
100 105 110 



270 



WO 97/37026 PCTAJS97/05306 

Arg Val Val Gin Gly Val He He His Glu Tyr Gin Lys 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 351: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 



Met Ala Val Asn Asn Glu He Gly Ser He Gin Xaa He Glu Ala He 

15 10 i 5 

Ser Glu Phe Leu Ala Asp Lys Pro Thr He Ser Phe His Val Asp Ala 

20 25 30 

Val Gin Ala Leu Ala Lys He Pro Thr Glu Lys Tyr Leu Thr Glu Arg 

35 40 45 

Val Asp Cys Ala Thr Phe Ser Ser His Lys Phe His Gly Val Arg Gly 

50 55 60 

Val Gly Phe Val Tyr He Lys Ser Gly Lys Lys He Thr Pro Leu Leu 
65 70 75 80 

Thr Gly Gly Gly Gin Glu Arg Asp Tyr Arg Ser Thr Thr Glu Asn Val 

85 90 95 

Ala Gly He Ala Ala Thr Ala Lys Ala Leu Arg Leu Ser Met Glu Lys 

100 105 no 

Leu Asp He Phe Arg Ser Lys Thr Gly Gin Met Lys Ala Val He His 

115 120 125 

Gin Ala Leu Leu Asn Tyr Pro Asp He Phe Val Phe Ser Asp Glu Glu 

130 135 140 

Asp Phe Ala Pro His He Leu Thr Phe Gly He Lys Gly Val Arg Gly 
145 150 155 160 

Glu Val He Val His Ala Phe Glu Asp Tyr Asp lie Phe He Ser Thr 

165 170 175 

Thr Ser Ala Cys Ser Ser Lys Ala Gly Lys Pro Ala Gly Thr Leu He 

180 185 190 

Ala Met Gly Val Asp Lys Asp Lys Ala Lys Ser Ala Val Arg Leu Ser 

1^5 200 205 

Leu Asp Leu Glu Asn Asp Met Ser Gin Val Glu Gin Phe Leu Thr Lys 
210 215 220 
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Leu Lys Leu lie Tyr Asn Gin Thr Arg Lys Val Arg 
225 230 235 
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(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:352: 

Met Leu Val Thr Gly Ala lie Leu Gly Val Asn Val His lie Phe Ser 

15 10 15 

Pro Lys Glu Leu Phe Pro Glu Lys Glu lie Val Glu Leu Ala Glu Gly 

20 25 30 

Phe Ala Lys Glu Ser Gly Ala His Val Leu He Thr Glu Asp Ala Asp 

35 40 45 

Glu Ala Val Lys Asp Ala Asp Val Leu Tyr Thr Asp Val Trp Val Ser 

50 55 60 

Met Gly Glu Glu Asp Lys Phe Ala Glu Arg Val Ala Leu Leu Lys Pro 
65 70 75 80 

Tyr Gin Val Asn Met Asp Leu Val Lys Lys Ala Gly Asn Glu Asn Leu 

85 90 95 

He Phe Leu His Cys Leu Pro Ala Phe His Asp Thr His Thr Val Tyr 

100 105 110 

Gly Lys Asp Val Ala Glu Lys Phe Gly Val Glu Glu Met Glu Val Thr 

115 120 125 

Asp Glu Val Phe Arg Ser Lys Tyr Ala Arg His Phe Asp Gin Ala Glu 

130 135 140 

Asn Arg Met His Thr He Lys Ala Val Met Ala Ala Thr Leu Gly Asn 
145 150 155 160 

Leu Tyr He Pro Lys Val 
165 

(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 53: 

Met Arg Leu Glu Gin Asp Cys Pro Val Phe Leu Lys He Lys Glu Lys 

15 10 15 

Asp Met Ala Ser Lys Met Leu His Thr Cys Leu Arg Val Glu Asn Leu 

20 25 3 0 

Glu Lys Ser He Ala Phe Tyr Gin Asp Ala Phe Gly Phe Lys Glu Leu 

35 40 45 

Arg Arg Arg Asp Phe Pro Asp His Ala Phe Thr He Val Tyr Leu Gly 

50 55 60 

Leu Glu Gly Asp Asp Tyr Glu Leu Glu Leu Thr Tyr Asn Tyr Asp His 
65 70 75 80 

Gly Pro Tyr Val Val Gly Asp Gly Phe Ala His lie Ala Leu Ser Thr 

85 90 95 

Pro Asp Leu Glu Ala Leu His Gin Glu His Ser Thr Lys Gly Tyr Glu 

100 105 110 

Val Thr Glu Pro Asn Gly Leu Pro Gly Thr Ala Pro Asn Tyr Tyr Phe 

115 120 125 

Val Lys Asp Pro Asp Gly Tyr Lys Val Glu Val He Arg Glu Lys 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 354: 

Met Asp His Thr He Phe Cys Val Gly Tyr Arg Val Met Gin Lys Asp 

1 5 io 15 

Leu Glu Gly Thr Leu Asp Ala Glu Lys Leu Lys Ala Ala Gly Val Pro 

20 25 30 

Phe Gly Pro Leu Phe Gly Lys He Lys Asn Gly Gin Asp Leu Val Leu 
35 40 45 
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Glu Asp Gly Thr Glu He Lys Ala Ala Asp Tyr He Ser Ala Pro Arg 

50 55 60 

Pro Gly Lys He He Thr He Leu Gly Asp Thr Arg Lys Thr Asp Ala 
65 70 75 80 

Ser Val Arg Leu Ala Val Asn Ala Asp Val Leu Val His Glu Ser Thr 

85 90 95 

Tyr Gly Lys Gly Asp Glu Lys He Ala Arg Asn His Gly His Ser Thr 

100 105 HO 

Asn Met Gin Ala Ala Gin Val Ala Val Glu Ala Gly Ala Lys Arg Leu 

115 120 125 

Leu Leu Asn His He Ser Ala Arg Phe Leu Ser Lys Asp He Ser Lys 

130 135 140 

Leu Lys Lys Asp Ala Ala Thr He Phe Glu Asn Val His Val Val Lys 
145 150 155 160 

Asp Leu Glu Lys Met Glu He Tyr Gin Ser Gin Lys Gly 



(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 

Met He Gin Pro Ala Ser Leu Glu Glu Leu Ala Ser Leu Val Glu Lys 

15 10 15 

Ala Gly Lys Lys Val Phe Leu Phe Val Ala Asp Trp Cys Gly Asp Cys 

20 25 30 

Arg Tyr lie Tyr Pro Ala Leu Pro Glu He Glu Glu Thr Asn Pro Glu 

35 40 45 

Phe Thr Phe He Arg Met Asp Arg Asp Gin Tyr Met Asp Leu Ala Lys 

50 55 60 

Leu Trp Asp Val Tyr Gly He Pro Ser Leu Val Val Leu Glu Lys Asp 
65 70 75 80 

Lys Glu He Gly Arg Phe Val Asn Arg Asp Arg Lys Ser Lys Glu Gin 

85 90 95 

He Asn Asp Phe Leu Ala Gly Leu Lys 



165 



170 



100 



105 
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(2) INFORMATION FOR SEQ ID NO: 356: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

Met Arg Leu Gly Leu Leu Gin Val Leu Arg Leu Pro Lys Ala Phe Gin 

15 10 is 

Leu He Phe He Gin Asp Lys Gly His Gly Asp Val Ser Ser Phe Thr 

20 25 30 

Ala Ser Cys Val Thr Gly Pro Trp Ala Phe Lys Leu Val Gly Lys Gin 

35 40 45 

Gly Asn He His 
50 

(2) INFORMATION FOR SEQ ID NO: 357 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:357: 

Met Lys Thr Arg Lys He Pro Leu Arg Lys Ser Val Val Ser Asn Glu 

15 io 15 

Val He Asp Lys Arg Asp Leu Leu Arg lie Val Lys Asn Lys Glu Gly 

20 25 30 

Gin Val Phe He Asp Pro Thr Gly Lys Ala Asn Gly Arg Gly Ala Tyr 

35 40 45 

He Lys Leu Asp Asn Ala Glu Ala Leu Glu Ala Lys Lys Lys Lys Val 

50 55 60 

Phe Asn Arg Ser Phe Scr Met Glu Val Glu Glu Ser Phe Tyr Asp Glu 
65 70 7 5 80 
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Leu lie Ala Tyr Val Asp His Lys Val Lys Arg Arg Glu Leu Gly Leu 
85 90 95 

Glu 



(2) INFORMATION FOR SEQ ID NO: 3 58: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 



Met Arg Val Ala Tyr Phe Lys Val His His Pro lie Tyr Tyr Tyr Cys 

15 10 15 

Ala Tyr Phe Ser lie Arg Ala Lys Ala Phe Asp lie Lys Thr Met Gly 

20 25 30 

Ala Gly Leu Glu Val lie Lys Arg Arg Met Glu Glu He Ser Glu Lys 

35 40 45 

Arg Lys Asn Asn Glu Ala Ser Asn Val Glu He Asp Leu Tyr Thr Thr 

50 55 60 

Leu Glu He Val Asn Glu Met Trp Glu Arg Gly Phe Lys Phe Gly Lys 
65 70 75 80 

Leu Asp Leu Tyr Cys Ser Gin Thr Thr Glu Phe Leu He Asp Gly Asp 

85 90 95 

Thr Leu He Pro Pro Phe Val Ala Met Asp Gly Leu Gly Glu Asn Val 

100 105 110 

Ala Lys Gin Leu Val Arg Ala Arg Glu Glu Gly Glu Phe Leu Ser Lys 

115 120 125 

Thr Glu Leu Arg Lys Arg Gly Gly Leu Ser Ser Thr Leu Val Glu Lys 

130 135 140 

Met Asp Glu Met Gly He Leu Gly Asn Met Pro Glu He Thr Ser 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 359: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 



Met Ser Ser Lys lie Ser lie Gly Gin Leu lie Thr Phe Asn Thr Leu 

15 10 l5 

Leu Ser Tyr Phe Thr Thr Pro Met Glu Asn He He Asn Leu Gin Thr 

20 25 30 

Lys Leu Gin Ser Ala Lys Val Ala Asn Asn Arg Leu Asn Glu Val Tyr 

35 40 45 

Leu Val Glu Ser Glu Phe Gin Val Gin Glu Asn Pro Val His Ser His 

50 55 60 

Phe Leu Met Gly Asp He Glu Phe Asp Asp Leu Ser Tyr Lys Tyr Gly 
65 70 75 80 

Phe Gly Arg Asp Thr Leu Thr Asp He Asn Leu Thr He Lys Gin Gly 

85 90 95 

Asp Lys Val Ser Leu Val Gly Val Ser Gly Ser Gly Lys Thr Thr Leu 

100 105 HO 

Ala Lys Met He Val Asn Phe Phe Glu Pro Tyr Lys Gly His He Ser 

115 120 125 

He Asn His Gin Asp He Lys Asn He Asp Lys Lys Ser Leu Ala Pro 

130 135 140 

Ser Tyr 
145 



(2) INFORMATION FOR SEQ ID NO: 3 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 



Met Arg Gin Ala Pro Ala Leu Ala Gin Ala Asn lie Glu Arg Val Val 

1 5 10 15 

Val His Lys lie Ser Lys Val Trp Glu Phe His Phe Arg He Phe 
20 25 30 
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(2) INFORMATION FOR SEQ ID NO: 361: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

Met Gin Thr Gly Trp Gin Tyr Leu Gly Asn Lys Trp Tyr Tyr Leu Arg 

15 10 15 

Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Glu Gly Thr Thr Trp 

20 25 30 

Tyr Tyr Leu Asp Gin Pro Asn Gly Asp Met Lys Thr Gly Trp Gin Asn 

35 40 45 

Leu Gly Asn Lys Trp Tyr Tyr Leu Arg Ser Ser Gly Ala Met Ala Thr 

50 55 60 

Gly Trp Tyr Gin Glu Gly Thr Thr Trp Tyr Tyr Leu Asp Gin Pro Asn 
65 70 75 80 

Gly Asp Met Lys Thr Gly Trp Gin Asn Leu Gly Asn Lys Trp Tyr Tyr 

85 90 95 

Leu Arg Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Asp Gly Ser 

100 105 110 

Thr Trp Tyr Tyr Leu Asn Ala Val Met Glu lie 
115 120 

(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362 : 

Met Lys Trp Gly Ser Arg lie Leu Leu Gly Leu Thr Pro Lys Ser Leu 

1 5 10 15 
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Arg Tyr Arg He Trp Lys Lys Ala Glu Lys Glu Met Thr Lys Tyr Asp 

20 25 30 

Leu Ala Asp Cys Asp Gly He Thr Glu Leu Cys Ser Gly Pro Gly Tyr 

35 40 45 

Met Arg Asn Lys Tyr Pro He Thr Ser Phe Glu Asp Asn Leu Phe Leu 

50 55 60 

Pro Phe Glu Gly Thr Glu Met Pro He Pro He Gly Tyr Asp Val Tyr 
65 70 75 80 

Leu Arg Thr Ala Phe Gly Asp Tyr Met Thr Pro Pro Pro Ala Asp Lys 

85 90 95 

Gin Val Pro His Gin Asp Ala Val He Ala Asp Met Asp Lys Ser Tyr 

100 105 no 

Thr Glu Tyr Lys Gly Glu Tyr Gly Gly 
115 120 

(2) INFORMATION FOR SEQ ID NO: 363: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D). TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 



Met Phe Pro Asp Ser He Asp Thr Met His Gin Ala Asn Glu Phe He 

15 10 15 

Ala Leu Asp Asp Leu Phe Arg Ala Ala Ala He Tyr Ala Glu Ala He 

20 25 30 

Tyr Glu Leu He Lys 
35 



(2) INFORMATION FOR SEQ ID NO: 364: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
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Met Asp Glu Leu Gin Lys Arg Asn Leu Leu Gly Phe Val Phe Gin Asp 

1 5 10 15 

Phe Gin Leu Phe Pro His Leu Ser Val Leu Glu Asn Leu Thr Leu Ser 

20 25 30 

Pro Val Lys Thr Met Gly Met Lys Gin Glu Glu Ala Glu Lys Lys Ala 

35 40 45 

Ser Gly Leu Leu Glu Gin Leu Gly Leu Gly Gly His Ala Glu Ser Tyr 

50 55 60 

Pro Phe Ser Leu Ser Gly Gly Gin Lys Gin Arg Val Ala Leu Ala Arg 
65 70 75 80 

Ala Met Met He Asp Pro Glu He He Gly Tyr Asp Glu Pro Thr Ser 

85 90 95 

Ala Leu Asp Pro Glu Leu Arg Leu Glu Val Glu Lys Leu He Leu Gin 

100 105 HO 

Asn Arg Glu Leu Gly Met Thr His He Val Val Thr His Asp Leu Gin 
115 120 125 

Phe Gly 
130 

(2) INFORMATION FOR SEQ ID NO: 3 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 65: 

Met Tyr Gly Ala Asn Ser Glu Arg Leu Lys Glu Ser Leu Arg He Ser 

15 10 15 

Leu Ser Pro Gin Asn Thr Val Glu Asp Leu Gin Thr Leu Ala Lys Thr 

20 2 5 30 

Leu Lys Glu lie He Gly Gly 
35 

(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 41 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 



Met He Ala Arg Ser Tyr Gin Glu Met Asp Asp Leu Asp Thr Ala Tyr 

15 10 15 

Glu His Tyr Gin Glu Leu Thr Gly Asp Leu Lys Asp Asn Pro Glu Phe 

20 25 30 

Leu Glu His Tyr He Tyr Leu Leu Arg 
35 40 



(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 



Met Lys His Phe Asp Thr He Val He Gly Gly Gly Pro Ala Gly Met 

1 5 io 15 

Met Ala Thr He Ser Ser Ser Phe Tyr Gly Gin Lys Thr Leu Leu He 

20 25 30 

Glu Lys Asn Arg Lys Leu Gly Lys Lys Leu Ala Gly Thr Gly Gly Gly 

35 40 45 

Arg Cys Asn Val Thr Asn Asn Gly Ser Leu Asp Asn Leu Leu Ala Gly 

50 55 60 

lie Pro Gly Asn Gly Arg Phe Leu Tyr Ser Val Phe Ser Gin Phe Asp 
65 70 75 so 

Asn His Asp He He Asn Phe Phe Thr Glu Asn Gly Val Lys Leu Lys 

85 90 95 

Val Glu Asp His Gly Arg Val Phe Pro Ala Ser Asp Lys Ser Arg Thr 

100 105 no 

He He Glu Ala Leu Glu Lys Lys He Thr Glu Leu Gly Gly Gin Val 
115 120 125 
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Ala Thr Pro Asn Arg Asn Arg Phe Cys 
130 135 
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<2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 68 : 

Met Thr Leu Ala Lys Asp lie Ala Ser His Leu Leu Lys lie Gin Ala 

15 10 15 

Val Tyr Leu Lys Pro Glu Glu Pro Phe Thr Trp Ala Ser Gly lie Lys 

20 25 30 

Ser Pro lie Tyr Thr Asp Asn Arg Val Thr Leu Ala Tyr Pro Glu Thr 

35 40 45 

Arg Thr Leu lie Glu Asn Gly Phe Val Glu Ala He Lys Glu Ala Phe 

50 55 60 

Pro Glu Val Glu Val He Ala Gly Thr Ala Thr Ala Gly He Pro His 
65 70 75 80 

Gly Ala He He Ala Asp Lys Met Asp Leu Pro Phe Ala Tyr He Pro 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

Met Val Lys Val Leu Ala Ala Cys Gly Asn Gly Met Gly Ser Ser Met 

15 10 15 

Val He Lys Met Lys Val Glu Asn Ala Leu Arg Lys Leu Asn Gin Thr 
20 25 30 
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Asp Phe Thr Val Asn Ser Cys Ser Val Gly Glu Ala Lys Gly Leu Ala 

35 40 45 

Val Gly Tyr Asp lie Val He Ala Ser Leu His Leu He Gin Glu Leu 

50 55 60 

Glu Gly Arg Thr Asn Gly Lys Leu He Gly Leu Asp Asn Leu Met Asp 
65 70 75 so 

Asp Lys Glu He Thr Glu Lys Leu Ser Gin Ala He Gin 
85 90 



(2) INFORMATION FOR SEQ ID NO: 370: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 



Met Ser Met Phe Leu Asp Thr Ala Lys He Lys Val Lys Ala Gly Asn 

15 10 15 

Gly Gly Asp Gly Met Val Ala Phe Arg Arg Glu Lys Tyr Val Pro Asn 

20 25 30 

Gly Gly Pro Trp Gly Gly Asp Gly Gly Arg Gly Gly Asn Val Val Phe 

35 40 45 

Val Val Asp Glu Gly Leu Arg Thr Leu Met Asp Phe Arg Tyr Asn Arg 

50 55 60 

His Phe Lys Ala Asp Ser Gly Glu Lys Gly Met Thr Lys Gly Met His 
65 7 0 75 80 

Gly Arg Gly Ala Glu Asp Leu Arg Val Arg Val Pro Gin Gly Thr Thr 

85 90 95 

Val Arg Asp Ala Glu Thr Gly Lys Val Leu Thr Asp Leu He Glu His 

100 105 HO 

Gly Gin Glu Phe He Val Ala His Gly Gly Pro Trp Trp Thr Trp Lys 

115 120 125 

Tyr Ser Phe Arg Asp Thr Lys Lys Ser Cys Thr Gly Asn Leu 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 371: 

{i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 210 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 



Met Met Lys Gly Glu Met Thr Phe Lys Gin Val His Tyr Lys Tyr Gly 

15 10 is 

Tyr Gly Arg Asp Val Leu Ser Asp lie Asn Leu Thr Val Pro Gin Gly 

20 25 30 

Ser Lys Val Ala Phe Val Gly lie Ser Gly Ser Gly Lys Thr Thr Leu 

35 40 45 

Ala Lys Met Met Val Asn Phe Tyr Asp Pro Ser Gin Gly Glu lie Ser 

50 55 60 

Leu Gly Gly Val Asn Leu Asn Gin lie Asp Lys Lys Ala Leu Arg Gin 
65 70 75 80 

Tyr lie lie Tyr Leu Pro Pro Gin Pro Tyr Val Phe Asn Gly Thr lie 

85 90 95 

Leu Glu Asn Leu Leu Leu Gly Ala Lys Gly Gly Asp Asp Thr Gly Lys 

100 105 110 

He Ser Leu Thr Gly Arg Ser Glu Phe Gly Ser Glu He Pro Lys Arg 

115 120 125 

Asn He Ser Lys Pro Pro Cys His Leu Glu He Tyr Gin Thr Glu Leu 

130 135 140 

Thr Ser Asp Gly Ala Gly He Ser Gly Gly Gin Arg Gin Arg He Ala 
145 150 155 160 

Leu Ala Arg Ala Leu Leu Thr Asp Ala Pro Val Leu He Leu Asp Glu 

165 170 175 

Ala Thr Asn Ser Leu Asp He Leu Thr Lys Lys Arg He Val His Tyr 

180 185 190 

Leu Met Ala Leu Asp Lys Thr Phe Asp Phe His Cys Ser Pro Leu Asp 
195 200 205 

Tyr Cys 
210 



(2) INFORMATION FOR SEQ ID NO: 372: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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fii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 



Met Ala Arg Phe lie Glu lie Asp 

1 5 
Ser Gin lie Ser His Phe Pro His 
20 

Gin Thr Ala Val Tyr Ala Gin Glu 

35 40 
Gly Arg Trp Phe Ser Arg Tyr Asp 

50 55 
Asn Val Asp Leu His Ser Leu Val 
65 70 



Ala Lys Glu His Asp Arg Val Thr 

10 15 
lie Leu Ala Ser Ser Leu Met Glu 
25 30 
His Glu Asn Gly Lys Ala Leu Cys 
45 

Pro Asn Cys Gly Lys Arg Ala Arg 
60 

Gin 



(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 149 amino acids 
{B} TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 



Met Ala His Gin Gly Gin Asp He Tyr Asp Phe Pro Arg Ala Met He 

1 5 10 15 

Lys Glu Asp Asn Leu Glu Phe Ser Phe Ser Gly Leu Lys Ser Ala Phe 

20 25 30 

He Asn Leu His His Asn Ala Glu Gin Lys Gly Glu Ser Leu Ser Thr 

35 40 45 

Glu Asp Leu Cys Ala Ser Phe Gin Ala Ala Val Leu Asp He Leu Met 

50 55 60 

Ala Lys Thr Lys Lys Ala Leu Glu Lys Tyr Pro Val Lys Thr Leu Val 
65 70 75 80 

Val Ala Gly Gly Val Ala Ala Asn Lys Gly Leu Arg Glu Arg Leu Ala 

35 90 95 

Ala Glu Val Thr Asp Val Lys v-U He He Pro Pro Leu Arg Leu Cys 

100 105 no 

Gly Asp Asn Ala Gly Met He Ala Tyr Ala Ser Val Ser Glu Trp Asn 
115 120 125 
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Lys Glu Asn Phe Ala Asn Leu Asp Leu Asn Ala Lys Pro Ser Leu Ala 

130 135 140 

Phe Asp Thr Met Glu 
145 



(2) INFORMATION FOR SEQ ID NO: 374: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 



Met Lys Arg Ser Leu Asp Ser Arg 

1 5 
Val Phe Phe Leu Leu Val He Gly 
20 

Ser His Asp Tyr Pro Asn Asn He 

35 40 
Ala Trp He Ala Leu Gly Leu Val 

50 55 
Asn Thr Glu Phe Leu Trp Lys Val 
65 70 
Leu Gly Thr Tyr Asp Leu Ala Asp 
85 



Val Asp Tyr Ser Leu Leu Leu Pro 

10 15 
Val Val Ala He Tyr He Ala Val 
25 30 
Leu Pro He Leu Gly Gin Gin Val 
45 

He Gly Phe Val Val Met Leu Phe 
60 

Thr Pro Phe Leu Tyr He Phe Arg 

75 80 
Cys He Leu 
90 



(2) INFORMATION FOR SEQ ID NO: 375: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



Met Pro Asp Asp He He Asp Ser Phe Trp Tyr He He Asp His Phe 
15 10 15 



286 



WO 97/37026 PCT/US97V05306 

Leu Lys Asn Val Phe Glu Leu Glu Glu Glu Leu Glu Phe Gin Leu Leu 

20 25 30 

Asn Asn Gin Gly Lys lie Thr Phe His Phe Ser Ser Gin His Leu Pro 

35 40 45 

Thr Ala lie Asp Phe Asp Phe Asn His Pro Phe Asp Pro Arg Tyr Pro 

50 55 60 

Pro Arg Val Leu Val Leu Asp Met Asp Gly Arg Glu Thr lie Leu Leu 
65 70 75 80 

Pro Glu Glu Asn Asp Leu Phe 
85 



(2) INFORMATION FOR SEQ ID NO: 376: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 



Met Gly Lys Lys Asp Ala Ser Ala Met His Glu Met Arg Ala Ser Phe 

15 10 15 

He Gin Gly Ser He Glu Ala Gly His Thr Ala Glu Lys Ser Glu Gin 

20 25 30 

Val Phe Asp Val Met Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser 

35 40 45 

His Ala Tyr Ala Tyr Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys 

50 55 60 

Thr His Tyr Pro Ala He Phe Tyr Gin Val Met Leu Asn Tyr Ser Asn 
65 70 75 80 

Ser Asp Tyr Leu He Asp Ala Leu Glu Ala Gly Phe Glu Val Ala Ser 

85 90 95 

Leu Ser He Asn Thr He Pro Tyr His Asp Lys He Ala Asn Lys Ser 

100 105 no 

He Tyr He Gly Leu Lys Ser He Lys Gly Leu Gin Gin Gly Leu Gly 

115 120 125 

Ala Leu Asp Tyr 
130 



(2) INFORMATION FOR SEQ ID NO: 377: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 



Met Asp Lys Lys Gin Asn Leu Thr 

1 5 
Glu Leu Asn Gin lie Thr Gly Gly 
20 

Glu Thr lie Leu Ser Lys Phe Lys 

35 40 
lie Gin Leu 
50 



Ser Phe Gin Glu Leu Thr Thr Thr 

10 15 
Glu Trp Trp Glu Glu Leu Leu His 
25 30 
lie Thr Lys Ala Leu Glu Leu Pro 
45 



(2) INFORMATION FOR SEQ ID NO: 378: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 



Met His Glu Met Phe Thr Ala lie 

1 5 
Ala Ala lie Gin Arg Lys Phe lie 
20 

Tyr Pro Ser Pro Ala Pro Val Lys 

35 40 
Glu Ala Gly Pro Thr Arg Leu Pro 

50 55 
Val Lys Arg lie lie Lys Val Val 
65 70 
Lys Ala Thr Val Thr Gly Val Leu 
85 



Ala Glu Ser Asp Met Lys Lys Ala 

10 15 
Pro Lys Val Asn Ala Leu Phe Ser 
25 30 
Ala lie Leu Asn Tyr Met Gly Phe 
45 

Leu Val Pro Ala Pro Glu Glu Asp 
60 

Val Asp Gly Asp Tyr Glu Ala Thr 

75 80 
Arg Pro Asp Tyr 
90 



288 



WO 97/37026 

(2) INFORMATION FOR SEQ ID NO: 379: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 



Met Leu Asn Glu Phe Pro lie Phe 

1 5 
Pro Asn Lys Cys Val lie Lys Ser 
20 

Thr Leu Gly Asn His Thr Phe Lys 

35 40 
Gin Thr lie Leu Asp Glu Asn Val 

50 55 
Tyr Leu Tyr Thr Tyr Ala Pro Phe 
65 70 



Asp Tyr Glu Asp He Gin Leu He 

10 i 5 
Arg Ala Glu Ala Asp Thr Ser Val 
25 30 
Leu Pro Val Val Pro Ala Asn Met 
45 

Ala Glu Gin Leu Ala Lys Gly Gly 
60 



(2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 3 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 380: 

Met Asp He Lys Leu Lys Asp Phe Glu Gly Pro Leu Asp Leu Leu Leu 

1 5 io 15 

His Leu Val Ser Lys Tyr Gin Met Asp He Tyr Asp Val Pro He Thr 

20 25 30 

Glu Val He Glu Gin Tyr Leu Ala Tyr Val Ser Thr Leu Gin Ala Met 

35 40 45 

Arg Leu Glu Val Thr Gly Glu Tyr Met Val Met Ala Ser Gin Leu Met 
50 55 60 
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Leu He Lys Ser Arg Lys Leu Leu Pro Lys Val Ala Glu Val Thr Asp 
65 70 75 80 

Leu Gly Asp Asp Leu Glu Gin Asp Leu Leu Ser Gin He Glu Glu Tyr 

85 90 95 

Arg Lys Phe Lys Leu Leu Gly Glu His Leu Glu Ala Lys His Gin Glu 

100 105 no 

Thr Gly Pro Val Leu Phe Gin Ser Ala Asp Arg Val Asp Leu Arg Arg 

115 120 125 

Cys Gly Ala Cys Ala 
130 

(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

Met Gin Ser Thr Glu Lys Lys Pro Leu Thr Ala Phe Thr Xaa He Ser 

15 10 15 

Thr He He Leu Leu Leu Leu Thr Val Leu Xaa He Phe Pro Phe Tyr 

20 25 30 

Trp He Leu Thr Gly Ala Phe Lys Ser Gin Pro Asp Thr He Val He 

35 40 45 

Pro Pro Gin Trp Phe Pro Lys Met Pro Thr Met Glu Asn Phe Gin Gin 

50 55 60 

Leu Met Val Gin Asn Pro Ala Leu Gin Trp Met Trp Asn Ser Val Phe 
65 70 75 80 

He Ser Leu Val Thr Met Phe Leu Val Cys Ala Thr Ser Ser Leu Ala 

85 90 95 

Gly Tyr Val Leu Ala Xaa Lys Arg Phe Tyr Gly Xaa Arg He Tyr Cys 
100 105 110 

(2) INFORMATION FOR SEQ ID HO: 3 82: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 



Met Leu Phe Met Arg Asp Ser Leu Asp Ser He Val Glu Pro Val Leu 

15 10 15 

Asp Glu Met Gly Arg Phe Tyr Asp Trp Thr Glu Glu Glu Lys Ala Thr 

20 25 3 0 

Tyr Arg Ala Asp Val Lys Ala Ala Leu Ala Gin Asn Asp Leu Ala Glu 

35 40 45 

Leu Lys Asn 
50 

(2) INFORMATION FOR SEQ ID NO:383: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:383: 

Met Lys He Lys Val Val Thr Val Gly Lys Leu Lys Glu Lys Tyr Leu 

Lys Asp Gly He Ala Glu Tyr Ser Lys Arg He Ser Arg Phe Ala Lys 

20 25 30 

Phe Glu Met He Glu Leu Ser Asp Glu Lys Thr Pro Asp Lys Ala Ser 

35 40 45 

Glu Ser Glu Asn Gin Lys He Leu Glu He Glu Gly Gin Arg lie Leu 

50 55 60 

Ser Lys He Ala Asp Arg Asp Phe Val He Val Leu Ala lie Glu Gly 
65 7 ° 75 80 

Lys Thr Phe Phe Ser Glu Glu Phe Ser Lys Gin Leu Glu Glu Thr Ser 

85 90 95 

He Lys Gly Xaa Ser Thr Leu Thr Phe lie lie Gly Gly Ser Leu Gly 

100 105 no 

Leu Ser Ser Ser Val Lys Asn Arg Ala Asn Leu Ser Val Ser Phe Gly 

120 125 
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Arg Leu Thr Leu Pro His Gin Leu Met Arg Leu Val Leu Val Glu Gin 

130 135 140 

lie Tyr Arg Ala Phe Thr lie Gin Gin Gly Phe Pro Tyr His Lys 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 3 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 



Met Gly lie Leu Ala Tip Cys Asp 

1 5 
Ser Phe Met Asp Leu Gly Leu Phe 
20 

Lys Lys Thr Leu Asn Lys Glu Arg 

35 40 
Pro Glu Asn Arg Trp lie Gin Leu 

50 55 
Val Val Met Leu Ser Phe Leu lie 
65 70 
Trp Phe Lys Lys 



Leu Val Leu Tyr Arg Leu Trp Asp 

10 15 
lie Asn Asp Ala Trp Val Arg Lys 
25 30 
Lys Lys Ala Gly Lys Ala Ala Leu 
45 

Leu Gly Met Val Val Thr Phe His 
60 

Phe Ser Gly Phe Leu Asn Asn Leu 
75 80 



(2) INFORMATION FOR SEQ ID NO: 385: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

Met Val Met Ala Glu Asp Gin Ala Val Arg Gin Asn Arg Leu Ala lie 
15 10 15 
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Leu Ser Gin Leu Thr Lys Lys Ala Ala Lys Phe Ala Cys Phe Asn Gin 

20 25 30 

lie Asn Thr Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:386: 

Met Gly Ala Gin Pro Val Gin Asp Thr Glu Thr Ser Ser Ala Leu He 

15 10 15 

Ser Ser His Tyr Leu Asp Glu Gin Asp Leu Ser Glu Lys Leu Lys Ser 

20 25 30 

Glu Leu Gin Trp Phe Glu Leu Glu Asn Lys Leu Leu Asn Leu Trp Glu 
35 40 45 

His 



(2) INFORMATION FOR SEQ ID NO: 387 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

Met Asp Glu Val Thr lie Phe Gly He Asn Tyr Phe Lys Glu His Tyr 

15 10 15 

Pro Glu Lys Leu Ala Glu Arg Phe Lys Gin Met Lys lie Glu Glu Glu 

20 25 3 o 

Ala Pro Val He He Met Asp Met Thr Arg Ala Leu Gly Phe Arg Asp 
3 5 40 45 
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Asp Tyr Asp Arg Phe Tyr Ser Leu Phe Arg Glu Gly Ser Pro 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 388: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

Met Lys Val lie Asn Gin Thr Leu Leu Glu Lys Val He He Glu Arg 

15 10 15 

Ser Arg Ser Ser His Lys Gly Asp Tyr Gly Xaa Leu Leu Leu Leu Gly 

20 25 30 

Gly Thr Tyr Pro Tyr Gly Val Xaa Ser Ser Trp Leu Leu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 3 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

Met Asn Trp He Leu Leu Gin Gly Leu He Cys Gin Arg Asp Ala Ser 

15 10 15 

Tyr Asp Met Lys Gin Asp Asp Leu Asp Lys Val Ala Asp Tyr Leu Phe 

20 25 30 

Lys Thr Glu Glu Trp Thr Met Tyr Glu Leu He Leu Phe Gly Asn Leu 

35 40 45 

Tyr Ser Phe Tyr Asp Val Asp Tyr Val Thr Arg He Gly Arg Glu Val 

50 55 60 

Met Glu Arg Glu Glu Phe Tyr Gin Glu He Ser Arg His Lys Arg Leu 
65 70 75 80 
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Val Leu lie Leu Ala Leu Asn Cys Tyr Gin His Cys Leu Glu His Ser 

85 90 95 

Ser Phe Tyr Asn Ala Asn Tyr Phe Glu Ala Tyr Thr Glu Lys lie lie 

100 105 no 

Asp Lys Gly lie Lys Leu Tyr Glu Arg Asn Val Phe His Tyr Leu Lys 

H5 120 125 

Gly Phe Ala Leu Tyr Gin Lys Gly Gin Cys Lys Glu Gly Cys Lys Gin 

130 135 140 

Met Gin Glu Thr Met His lie Phe Asp Val Leu Gly Leu Pro Glu Gin 
145 150 155 i 60 

Val Ser Leu Leu Ser Gly Thr Leu Arg Lys lie Cys Gin Lys Leu He 

165 170 175 

Phe Pro Asn Lys Gly Lys Asn Lys Lys Leu Leu Ser Val Leu He Gin 
180 185 190 



(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

Met Ser Thr Val Leu Ser Trp Thr Ala Tyr Lys Thr Gin Arg Leu Glu 

1 5 io is 

Met Ser He Leu Leu His Met He Val Asn Gly He Ala Phe Cys Leu 

20 25 30 

Leu Ala Leu Val Val He Met Ser Arg Thr Leu Gly He Ser Val 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:391: 



Met Arg Glu Asn Asp Leu Leu Leu 

1 5 
Pro Thr Tyr Ala Gly Thr Asp His 
20 

Ala Tyr Ser Pro Ala Phe Lys Gly 

35 40 
Phe Ala Asp He Ser Ala Thr Val 

50 55 
Ala Met He Gly Glu Ser Phe Leu 
65 70 



He Thr Ala Asp His Gly Asn Asp 

10 15 
Thr Arg Glu Tyr He Pro Leu Leu 
25 30 
Asn Gly Leu He Pro Val Gly His 
45 

Ala Asp Asn Phe Gly Val Glu Thr 
60 

Asp Lys Leu Val 

75 



(2) INFORMATION FOR SEQ ID NO: 392: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 



Met Lys Lys He Phe Leu Thr Leu Leu Thr Val Ser Leu Leu Gly Gly 

15 10 15 

Ala Ser Thr Ala Val Ala Gin Asp Phe Thr He Ala Ala Lys His Ala 

20 25 30 

He Ala Val Glu Ala Asn Thr Gly Lys He Leu Tyr Glu Lys Asp Ala 

35 40 45 

Thr Pro Pro Val Glu He Ala Ser He Thr Lys Leu He Thr Val Tyr 

50 55 60 

Leu Val Tyr Glu Ala Leu Glu Asn Gly Ser He Thr Leu Ser Thr Pro 
65 70 75 80 

Val Asp He Ser Asp Tyr Pro Tyr Gin Leu Thr Thr Asn Ser Glu Ala 

85 90 95 

Ser Asn He Pro Met Glu Ala Arg Asn Tyr Thr Val Glu Glu Leu Leu 

100 105 110 

Glu Ala Thr Leu Val Ser Ser Ala Asn Ser Ala Ala He Ala Leu Ala 

115 12C 125 

Glu Lys He Ala Gly Ser Glu Lys Asp Phe Val Xaa Met Met Arg Ala 
130 135 140 
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Lys Leu Leu Glu Trp Gly He Arg He His Cys Cys Gin Tyr Asp Arg 
150 155 160 

Ser 



(2) INFORMATION FOR SEQ ID NO:393: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 

Met Ser Asn Ala Met Glu Gly Val Leu Tyr Phe Leu Lys Pro Asp Phe 

15 io 15 

Ser Lys Leu Thr Ser Ala Asp Leu Leu Tyr Ala Leu Gly Gin Ser Phe 

20 25 30 

Phe Ala Leu Ser Leu Gly Val Thr Asp Met Leu Thr Tyr Ala Ser Tyr 

35 40 45 

Leu Asp Lys Lys Thr Asn Leu Val Gin Ser Gly He Ser He Val Thr 

50 55 60 

Met Asn He Ser He Val His His Gly Arg Ser Ser His Phe Pro Ser 
65 70 75 80 

His Val Ser Leu Gin Tyr Pro Leu 
85 

(2) INFORMATION FOR SEQ ID NO: 394: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

!C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:394: 

Met Lys Val Ala Lys Pro Phe Trp Ala He Val Lys Leu Lys Ser Val 
1 5 io 15 
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Gin His Ala Ser Met Leu Glu Asn 
20 

Lys Gin Val Glu Asn Leu Ala Leu 

35 40 
Lys Glu Asn Lys Ala Phe Glu Gin 

50 55 
Gin Thr Phe Leu Thr lie Phe Leu 
S5 70 
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Pro Lys Glu Met Asp Gly Leu Met 
25 30 
Glu Asn Gin Gly Tyr Gin Val Glu 
45 

lie Lys Asp Ser Val Ala Thr Phe 
60 

Tyr Gly Cys 
75 



(2) INFORMATION FOR SEQ ID NO: 395 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 



Met Asp Ala Glu Val Ser Lys Asn Leu Arg Leu lie Leu Glu Arg Lys 

15 10 15 

Gly Met Thr lie Leu Thr Gly Thr Lys Leu Gin Glu lie lie Glu Glu 

20 25 30 

Asn Gly Gin Leu Arg lie Lys Val Glu Gly Lys Asp Asn lie lie Ala 

35 40 45 

Ser Lys Ala Leu Leu Ser lie Gly Arg Met Pro Asp Leu Glu Gly lie 

50 55 60 

Gly Glu Val Glu Phe Glu Leu Asp Arg Gly Cys lie Lys Val Asn Glu 
65 70 75 80 

Tyr Met Glu Thr Ser Val Pro Arg lie Tyr Ala Thr Arg 
85 90 

(2) INFORMATION FOR SEQ ID NO: 396: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(Di TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 



Met Lys Asn Ser lie Met Asp Thr 

1 5 
Leu Asn Lys lie lie lie Val Phe 
20 

Leu Leu Tyr He Val Val Ala Ser 

35 40 
Ser Arg Gly He Ser Phe Asn Pro 

50 55 
Gin Arg Val He Gin 
65 



Lys Phe Asp Arg Arg He Leu Leu 

10 15 
He Val Leu Met Thr Leu Leu Pro 
25 30 
Phe Met Asp Pro Lys Val Leu Val 
45 

Ala Asp Trp Thr Val Glu Gly Tyr 
60 



(2) INFORMATION FOR SEQ ID NO: 397: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

Met Val He Pro Glu Ala Gly Tyr Leu Tyr Ala Phe His Tyr Pro Asn 

1 5 io 15 

Leu Lys Gly Lys Gly Lys Glu Ala Val Gin Val He Tyr Asn Leu Asn 

20 25 30 

Leu Ala Ser Ala Lys Val lie Gin Leu Tyr Arg Ser Leu Gly Leu Asp 

35 40 45 

Gly Lys He Gly He He Leu Asn Leu Thr Pro Ala Tyr Pro Arg Ser 

50 55 60 

Asn Ser Pro Glu Asp Leu Glu Ala Ser Arg Phe Thr Asp Asp Phe Phe 
65 70 75 80 

Asn Lys Val Phe Leu Glu Ser Ser Cys 
85 



(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 



Met Thr Arg lie Ala Ser Val Ser 

1 5 
Ala lie Ala Leu Ser Phe Leu Gly 
20 

lie Pro Asn Ala Val Leu Gly Gly 

35 40 
lie Ala Ser Asn Gly Leu Lys Val 

50 55 
Ala Gin Met Arg Asn Leu lie He 
65 70 
Leu Gly Arg Ser Tyr Pro 
85 



Val He Arg Asn Ala Ala Phe He 

10 15 
Lys Phe Thr Ala Leu He Ser Thr 
25 30 
Met Ser He Leu Leu Tyr Gly Val 
45 

Leu He Lys Glu Arg Val Asp Phe 
60 

Ala Ser Ala Met Leu Val Leu Gly 
75 80 



(2) INFORMATION FOR SEQ ID NO: 399: 



(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 12 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: 

Met Ser Leu Arg Glu Lys Ser Met 

1 5 
Asn Trp Thr Arg Val Ala Val Phe 
20 

Arg Thr Asp Phe Asp Ala Phe Val 

35 40 
Val Leu Glu Glu Asp Leu Pro Leu 

50 55 
Ser Val Thr Glu Glu Glu Arg Gin 
65 70 
Phe Val Gin Ala Arg Pro Glu Lys 
85 



SEQ ID NO: 399: 

Ser Glu Tyr Lys Leu Ser Glu Asn 

10 15 
Ala Gly Gly Asn Arg Gly His Tyr 
25 30 
Gly Val Asp Arg Gly Ser Leu Trp 
45 

Ala Leu Ala Val Gly Asp Phe Asp 
60 

Val He Gin Lys Arg Ala Gin Tyr 

75 80 
Asp Asp Thr Asp Leu Glu Leu Ala 
90 95 
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Leu Leu Thr lie Phe Glu Gin Asn Pro Gin Ala Glu Val Thr lie Phe 

100 105 110 

Gly Ala Leu Gly Gly Arg lie Asp His Met Leu Ala Asn Val Leu Ser 
115 120 125 

Thr 



(2) INFORMATION FOR SEQ ID NO:400: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 400: 



Met Val Leu Gin Arg Asn Glu lie 

1 5 
Ser Thr lie Tyr Pro Thr Asp Gin 
20 

Leu Thr Glu Gin Leu Glu Thr Val 

35 40 
Asp Ser Ala Asp Asn Leu Leu Val 
50 55 



Asn Glu Lys Asp Thr Trp Asp Leu 

10 15 
Ala Trp Glu Glu Ala Leu Lys Asp 
25 30 
Ala Gin Tyr Glu Gly His Leu Leu 
45 

Asn His 



(2) INFORMATION FOR SEQ ID NO:401: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 



Met He Tyr Gly Ala He He Asp Ala Lys Thr Ala Glu Asn Ala Ala 

15 10 15 

Gly Met Thr Ala Met Gin Thr Ala Thr Asp Asn Ala Lys Lys Val He 

20 25 30 
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Asn Asp Leu Thr lie Gin Tyr Asn Arg Ala Gin Thr Gly Gly Tyr Tyr 

35 40 45 

Thr Arg Asn Tyr Arg Asn Arg Ser Ser Ala 
50 55 



(2) INFORMATION FOR SEQ ID NO: 402: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 



Met Thr Asp Gly Tyr Glu His Tyr Lys Lys He Met Glu Asn Pro Pro 

15 10 15 

Lys Asn Leu He Phe Pro Gly He Val Ser Pro Glu Arg Met Arg Glu 

20 25 30 

He Val Cys Ser Asn Gly Ser Phe Leu Val Ala 
35 40 



(2) INFORMATION FOR SEQ ID NO: 403: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403: 



Met Ser Lys Gin Lys Lys Phe Glu 

1 5 
lie Val Gin Ser Leu Glu Asn Gly 
20 

Thr Ala Phe Gin Lys Gly Met Val 

35 40 
Leu Asp Lys Ala Glu Lys Thr Leu 
50 55 



Glu Asn Leu Ala Glu Leu Glu Thr 

10 15 
Glu He Ala Leu Glu Asp Ala He 
25 30 
Leu Ser Lys Glu Leu Gin Ala Thr 
45 

Val Lys Val Met Gin Glu Asp Gly 
60 
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Thr Glu Ser Asp Phe Glu 
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(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 

Met Phe Ala Ala Val Thr Ala Gly Tyr Gin Ala Ala Leu Met Val Pro 

15 10 15 

Thr Glu lie Leu Ala Glu Gin His Phe Glu Ser Leu Gin Asn Leu Phe 

20 25 30 

Pro Asn Leu Lys Leu Ala Leu Leu Thr Gly Ser Leu Lys Ala Ala Glu 

35 40 45 

Lys Arg Glu Val Leu Glu Thr lie Ala Lys Gly Glu Ala Asp Leu lie 

50 55 60 

He Gly Thr His Ala Leu He Gin Asp Gly Val Glu Tyr Ala Arg Leu 
65 70 75 80 

Gly Leu He He He Asp Glu Gin His Arg Phe Gly Val Gly Gin Arg 

85 90 95 

Arg He Leu Arg Glu Lys Gly Asp Asn Pro Asp Val Leu Met Met Thr 

100 105 110 

Ala Thr Pro He Pro Arg Thr Leu Ala He Thr Ala Phe Gly Asp Met 

115 120 125 

Asp Val Ser He He Asp Gin Met Pro Ala Gly Arg Lys Pro Tyr Cys 

130 135 140 

Asp Ala Leu Asp Gin Thr 
145 150 

(2) INFORMATION FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 110 amino acids 
(B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:405: 

Met Asn Arg Glu Ala Leu Arg Leu Tyr Leu Val Thr Asn Arg Tyr Gin 

1 5 10 15 

Asp Ser Val Glu Ser Phe Leu Ala Lys Val Glu Thr Ala Cys Arg Ser 

20 25 30 

Gly Val Thr He Val Gin Leu Arg Glu Lys Asn Leu Thr Thr Asn Gin 

35 40 45 

Tyr Tyr Gin Leu Ala Lys Gin Val Lys Glu He Thr Asp Ala Tyr Gin 

50 55 60 

Val Pro Leu He He Asp Asp Arg Leu Asp Val Cys Leu Ala Val Asp 
65 70 75 80 

Ala Ala Gly Leu His He Gly Asp Asp Glu Leu Pro Val Ser Val Ala 

85 90 95 

Arg Gin Val Leu Gly Pro Glu Lys He Pro Arg Cys His Arg 
100 105 HO 

(2) INFORMATION FOR SEQ ID NO: 406: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

Met Gin Gly Ser His Phe His Leu Pro He Tyr Arg Met Thr Ser Gin 

1 5 10 15 

Ala Leu Leu Asp Glu Thr Lys Lys Val Ala He Pro Val Leu Ala Thr 

20 25 30 

Thr Gin Ser Lys Asp Ser Val Asp Tyr Arg Glu Leu Pro Ser He Glu 

35 40 45 

Asn Phe Val Leu Val Met Gly Asn Glu Gly Gin Gly He Ser Pro Leu 

50 55 *0 

Met Ala Glu Ser Ala Asp Gin Leu Val His He Ser Met Lys Gly Gin 
65 70 75 80 

Ala Glu Ser Leu Asn Val Ala Val Ala Ala Gly He Leu He Phe His 
85 90 3* 

Leu Ser 
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(2) INFORMATION FOR SEQ ID NO: 407: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 



Met Thr Lys Thr Ala Phe Leu Phe Ala Gly Gin Gly Ala Gin Tyr Leu 

15 10 15 

Arg Met Gly Arg Asp Phe Tyr Asp Gin Tyr Pro lie Val Lys Glu Thr 

20 25 30 

lie Asp Arg Ala Ser Gin Val Leu Gly Tyr Asp Leu Arg Tyr Leu lie 

35 40 45 

Asp Thr Glu Glu Asp Lys Leu Asn Gin Thr Arg Tyr Thr Gin Pro Ala 

50 55 60 

lie Leu Ala Thr Ser Val Ala He Tyr Arg Leu Leu Gin Glu Lys Gly 
65 70 75 80 

Tyr His Pro Asp Met Val Ala Gly Leu Ser Leu Gly Glu Tyr Ser Ala 

85 90 95 

Leu Val Ala Ser Gly Ala Leu Asp Phe Glu Asp Ala Val Ala Leu Val 

100 105 110 

Ala Lys Arg Gly Ala Tyr Met Glu Glu Ala Ala Pro Ala Asp Ser Gly 

115 120 125 

Lys Met Val Ala Val Leu His Thr Pro 
130 135 



(2) INFORMATION FOR SEQ ID NO: 408: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE; protein 



(3Ci> SEQUENCE DESCRIPTION: .SEQ ID NO:408: 

305 



WO 97/37026 



PCI7US97/05306 



Met Tyr Gin Phe Ser Gly Gin Thr Lys Val Xaa Glu Val Leu Ala Phe 

15 10 15 

Arg Asp Lys Pro Pro Tyr Gly Gly Ser Ser Ala Met Pro Leu Arg Cys 
20 25 30 

Leu 



(2) INFORMATION FOR SEQ ID NO: 409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 409: 

Met Thr Lys Glu Thr Gly Met Glu Gin Thr Phe Phe lie lie Lys Pro 

15 10 15 

Asp Gly Val Lys Arg Gly Leu Val Gly Glu Val Leu Lys Arg lie Glu 

20 25 30 

Gin Arg Gly Phe Thr lie Glu Lys Leu Glu Phe Arg Ser Gin Val Phe 

35 40 45 

Arg Arg Val Asp 
50 

(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

Met Thr Ser Gly Pro Val Leu Val Gly Val He Ser Gly Pro Lys Val 
15 10 15 
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He Glu Thr Trp Arg Thr Met Met Gly Ala Thr Arg Pro Glu Glu Ala 

20 25 30 

Leu Pro Gly Thr He Arg Gly Asp Phe Ala Lys Ala Ala Gly Glu Asn 

35 40 45 

Glu He He Gin Asn Val val His Gly Ser Asp Ser Glu Lys Asn Gin 

50 55 60 

Leu Ser Arg Glu He Ala Pro Leu Val Leu Arg Val Asp Trp Leu Asn 

65 70 75 go 

Gin Leu Asp Lys Ser Ser Phe Glu 
85 



(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 115 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 



Met Gly Leu Lys Asp Asn Ala Gly Leu Val Lys Glu Tyr Ala Leu Val 

15 10 15 

Asp Ala Val Glu Tyr Gin Asn Val He Val Ala Thr Thr Val Glu Glu 

20 25 30 

Met Leu Ser Lys Tyr Ala Asn Lys Asn Asp Leu Glu He Asp Asn Ala 

35 40 45 

Thr Thr Glu Ser He Lys Gly Val Val Ala Asp Leu Lys Ser Ala Val 

50 55 60 

He Lys Gly Asp Thr Val Tyr Phe Phe Lys Val Asp Gly Lys He Tyr 
65 70 75 80 

Lys Val Lys Ala Ser Val Ser Asp Asp Leu Pro Tyr Leu Glu Asn Gly 

85 90 95 

Lys Thr Phe Glu Gly Gin Val Xaa Lys Asp Asn Tyr Leu Lys Thr Phe 
100 105 no 

Lys Val Gin 
115 



(2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 50 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:412: 



Met lie Ser Gly Ser Glu lie Arg 

1 5 
Leu Ala Asp Lys Thr Glu Thr Leu 
20 

Gly Leu Asp Asn He Thr Val Ala 
35 40 

Glu Glu 
50 



Asp He Val Thr Ser Asp He Pro 

10 15 
Val Arg Phe Ala Asn Asn Ala Gly 
25 30 
Leu Val Ser Met Asn Glu Glu Asp 
45 



(2) INFORMATION FOR SEQ ID NO:413: 



{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 



Met Pro Asp Asn Leu Ala Leu Arg Met Arg Pro Lys Thr He Asp Gin 

15 10 15 

Val lie Gly Gin Glu Xaa Leu Val Gly Pro Gly Lys He He Arg Arg 

20 25 30 

Met Val Glu Ala Asn Arg Leu Ser Ser Met He Leu Tyr Gly Pro Pro 

35 40 45 

Gly He Gly Lys Thr Ser He Ala Ser Ala He Ala Gly Thr Thr Lys 

50 55 60 

Tyr Ala Phe Arg Thr Phe Asn Ala Thr Val Asp Ser 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 414: 



(i) SEQUENCE 
(A) LENGTH 



CHARACTER I ST ICS : 
: 33 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

Met Pro Glu Glu Leu Ala Tyr Leu Val Gin His Leu Asp Asn Ala Gin 

Glu Gin Val Val Leu Gly Asn Thr Tyr His Thr Gly Asn His Cys Phe 
20 25 30 

Ser 



(2) INFORMATION FOR SEQ ID NO: 415; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 

Met Val Phe Trp Glu Gly Leu Val Pro Thr Pro Asp Val He Asp Ala 

15 10 15 

Gly His Val Thr Gly Phe Leu Tyr Thr Val His Ser Ser Ser Thr Gly 

20 25 30 

Leu He Val Leu Gin He Lys Lys Asp Leu Leu Lys Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 416: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 

Met Thr Asp Arg Tyr Asp Ser Ser Leu Gly lie Tyr Lys Val Pro Trp 

15 10 15 

Cys Val Ser Gin Gin Gin Thr Val Thr Glu lie Met Asp Thr Tyr Cys 

20 25 30 

Asp Trp Gly Val Lys Tyr Pro Leu Val Tyr Leu Glu Asp Pro Phe Ser 

35 40 45 

Asp Glu Asp Leu Asp Ser Trp Arg Lys Phe Gin Leu lie Lys Pro Leu 

50 55 60 

Lys Leu Gin Val Phe Gly Asp Asp Phe Tyr Ala Thr Asn Leu Glu Arg 
65 70 75 80 

lie Ser Gin Phe Lys Asp Cys Ala Asp Gly lie Val lie Lys Pro Asn 

85 90 95 

Gin Val Gly Ser Val Ser Lys Thr Leu Glu Val Met Glu Tyr Ala Glu 

100 105 110 

Lys Ser Gly He Ser Met Ala Phe Ser Gin Arg Thr Ala Glu Thr Glu 

115 120 125 

Asn Asn He He Ser His Leu Ala Met Ser Val He Leu Leu He 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 4 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

Met Lys Lys He Ser Lys Gin Glu Leu Val Asn Thr Arg Phe Ser Arg 

15 10 15 

Leu Phe Ala His Phe Gly Gin Glu Lys Asp Gly Ser Phe Leu Ala Gin 

20 25 30 

Arg Tyr Gin Phe Tyr Leu Ala Gin Gin Gly Gin Thr Leu Ser Gly Ala 

35 40 45 

His Asp Leu Leu Asp Ser Leu lie Glu Arg Asp Tyr Asn Leu Tyr Ala 

50 55 60 

Ala Thr Asn Gly He Thr Ala He Gin Thr Gly Arg Leu Ala Gin Ser 
65 70 75 80 
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Gly Leu Ala Pro Tyr Phe Asn Gin Val Phe He Ser Glu Gin Leu Gin 

85 90 95 

Thr Gin Lys Pro Asp Ala Leu Phe Tyr Glu Lys He Gly Gin Gin He 

100 105 110 
Ala Gly Phe Ser Lys Glu Lys Thr 
115 120 

(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 18: 

Met His Arg Lys Thr Val He Asp Xaa Arg Thr Leu Gly Glu Arg Tyr 

15 10 15 

Thr Phe Thr Xaa Pro He Lys Glu Leu Lys Thr Arg Asn Val Ala Glu 

20 25 30 

Val Ala Asp Leu Leu Ala Xaa Val Glu Ser Tyr Gin Glu Gin Asp Tyr 

35 40 45 

Tyr Val Val Gly Tyr Val Ser Tyr Glu Ala Ala Pro Ala Phe Glu Glu 

50 55 60 

Lys Leu Ala Val His Lys Val Pro Leu Leu Gly Glu Tyr Leu Leu Tyr 
65 70 75 80 

Phe Thr Val His Asp Arg Val Glu Thr Ser Pro He Pro Leu Thr Tyr 

85 90 95 

Glu Asp He Asp Leu Pro Ser Asn Trp Gin Gly Xaa Asn Val Cys Thr 
100 105 HO 

Glu Leu 



(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 

Met Glu Arg He Glu Val Arg Ser Arg Ser Gly Ser Ala His Leu Gly 

1 5 10 15 

His Val Phe Thr Asp Gly Pro Arg Glu Leu Gly Gly Leu Arg Tyr Cys 

20 25 30 

He Asn Ser Ala Ser Leu Arg Phe Val Ala Lys Asp Glu Met Glu Lys 

35 40 45 

Ala Gly Tyr Gly Tyr Leu Leu Pro Tyr Leu Asn Lys 

50 55 60 



(2) INFORMATION FOR SEQ ID NO: 420: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 



Met Glu Trp He Arg Leu He Gly 

1 5 
He Leu Lys Phe Asp Thr lie Ala 
20 

Thr Ala Leu Val Ser Gly Val Ser 

35 40 
Gly Lys Glu Phe Ser Asn Gin Arg 

50 55 
Leu Pro Leu Val Gly Leu Ser Glu 
65 70 
Ser He 



He Ala He He Val Val Gly Phe 

10 15 
Thr Val Val Leu Ala Gly Leu Val 
25 30 
Leu Val Glu Phe Leu Glu He Leu 
45 

Val Leu Thr He Phe Met Val Thr 
60 

Thr Phe Gly Leu Asn Asn Asp Gin 
75 80 



(2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 

Met Leu Leu lie Phe Ala Val lie Gly Val Leu Val Ala Leu lie Ala 

15 10 15 

Gin Phe Tyr Ser Ala Lys Ala Ala Val Gly Phe Ala Lys Glu Leu Thr 

20 25 30 

Asn Asp Leu Tyr Arg His lie Leu Ser Leu Pro Lys Asp Ser Arg Asp 

35 40 45 

Arg Leu Thr Thr Ser Ser Leu Val Thr Arg Leu Thr Ser Asp Thr Tyr 

50 55 60 

Gin lie Gin Thr Gly lie Asn Gin Phe Leu Arg Leu Phe Leu Arg Ala 
65 70 75 80 

Pro He He Val Phe Gly Ala He Phe Met Ala Tyr Arg He Ser Ala 

85 90 95 

Glu Leu Thr Phe Trp Phe Leu Val Met Val Ala He Leu Thr He Val 

100 105 110 

He Val Arg Val He Ser lie Gly Gin Ser Ser Leu Gin 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

Met Pro He Glu Gin Val Gin Lys Leu Leu Gly His Ser Lys He Asp 

15 10 15 

Thr Thr Leu Ala Tyr Ala Met Val Asn Gin Asn Asn Val Lys His Ser 

20 25 30 

His Gin Lys Phe He Ser 
35 

i2) INFORMATION FOR SEQ ID NO:423: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 



Met Lys Pro Cys Trp Lys Lys Cys 

1 5 
Gly Glu Val Thr Leu lie Glu lie 
20 

Lys Gin Val His Glu Leu Asn Leu 

35 40 
Gin Val His Asn Gly Lys Ser Gin 

50 55 
Tyr Leu Gly Asp Met He His Leu 
65 70 
Lys Val Lys Asp Leu Leu Leu 
85 



Phe Pro Glu Glu Val Ser Ser Glu 

10 15 
Pro Val Ser Asp Lys He Ala Gly 
25 30 
Pro His Asn Val Leu He Thr Thr 
45 

Thr Val Asn Gly Ser Thr Arg Met 
60 

Val He Pro Lys Ser Glu He Gly 
75 80 



(2) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 



Met Asn Glu Gly Val Glu Asn Phe 

1 5 
Ala Lys Asn lie Gly He Tyr Val 
20 

Ser He Asp Thr Gly Lys Phe Thr 

35 40 
Ser Asp Ser Gly Phe Leu Glu Ser 
50 55 



Arg Ala Lys Leu Ala Ser Leu Gly 

10 15 
Gly Val Tyr Phe Met Glu Glu His 
25 30 
Ser Val Trp He Pro Ser Tyr Gly 
45 

Ser Pro Lys Thr Asp Leu Asp Tyr 
60 
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Asp He His Gin Tyr Thr Ser Lys Gly Lys Leu Pro Ala Leu Thr Thr 
65 70 75 80 

He 



(2) INFORMATION FOR SEQ ID NO: 425: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:425: 



Met Val Ser Lys Tyr Leu Leu Leu 

1 5 
Thr Trp He Leu Arg Met He Ser 
20 

Leu Pro Ala He Val Glu Arg Phe 

35 40 
He Phe Ala Leu He Leu Ser Ser 

50 55 
Leu Pro Gin He Lys Trp Leu Asp 
65 70 
Trp Val Ala Phe Arg Tyr Arg Asn 
85 

Val Val Leu He Ala He Leu Arg 
100 



Ala Val He Phe Ser Gly Leu Val 

10 15 
Phe He Leu Val Lys Tyr Lys Gly 
25 30 
Leu Lys Phe Leu Pro Val Ser He 
45 

Val Val Thr Gly Lys Val Gly Ser 
60 

Phe Leu Ala Val Phe Pro Thr Ala 

75 80 
Leu Leu Gly Thr Val Leu Phe Gly 

90 95 
Leu Val Ser 
105 



(2) INFORMATION FOR SEQ ID NO: 426 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 
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Met Glu Asp Xaa Glu Thr Gin Arg Val Val Met Gin Tyr Arg Ala Pro 

1 5 10 15 

Glu Asn Asn Arg Trp Ser Gly Tyr Ala Phe Pro Gly Gly His Val Glu 

20 25 30 

Asn Asp Glu Ala Phe Ala Glu Ser Val lie Arg Glu He Tyr Glu Glu 

35 40 45 

Thr Gly Leu Thr He Gin Asn Pro Gin Leu Val Gly He Lys Asn Trp 

50 55 60 

Pro Leu Asp Thr Gly Gly Arg Tyr He Val He Cys Tyr Lys Ala Thr 
65 70 75 80 

Glu Phe Ser Gly Thr Leu Gin Ser Ser Glu Glu Gly Glu Val Ser Trp 

85 90 95 

Val Gin Lys Asp Gin He Pro Asn Leu Asn Leu Ala Tyr Asp Met Leu 

100 105 110 

Pro Leu Met Glu Met Met Glu Ala Pro Asp Lys Ser Glu Phe Phe Tyr 

115 120 125 

Pro Arg Arg Thr Glu Asp Asp Trp Glu Lys Lys He Phe 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 427: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 



Met Thr lie Glu Leu Leu Thr Pro 

1 5 
Glu He Lys Glu Lys Lys Arg Lys 
20 

Phe Asn Pro Val His Asn Ala His 

35 40 
Gin Gin Leu Gly Leu Asp Gin Val 

50 55 
Pro His Val 
65 



Phe Thr Lys Val Glu Leu Glu Pro 

10 15 
Gin Val Gly lie Leu Gly Gly Asn 
25 30 
Leu He Val Ala Asp Gin Val Arg 
45 

Leu Leu Met Pro Glu Tyr Gin Pro 
60 



(2) INFORMATION FOR SEQ ID NO: 428: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:428: 



Met Lys Lys Leu Val Phe Val Cys 

1 5 
Met Ala Glu Phe Val Met Lys Ser 
20 

Ser Arg Ala Thr Ser Ser Trp Glu 

35 40 
Thr Gin Gly lie Phe Gin Glu Tyr 

50 55 
Thr Ser Leu Gin lie Ser Lys Glu 
65 70 
lie Gly Met Asp Ala Ser Asn Val 
85 



Leu Gly Asn lie Cys Arg Ser Pro 

10 15 
Met Thr Asp Asn Tyr Glu lie Gin 
25 30 
His Gly Asn Pro lie His Lys Gly 
45 

Glu lie Pro Tyr Asp Lys Asn Lys 
60 

Asp Phe Glu Ala Phe Asp Tyr lie 

75 80 
Pro Thr Tyr Val Arg Cys Val Gin 
90 95 



(2) INFORMATION FOR SEQ ID NO: 429: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 



Met Val Lys Tyr Gly Val Val Gly 

1 5 
Ala Arg Tyr Met Gin Lys Asn Asp 
20 

Asp Pro Asp Asn Ala Glu Ala lie 

35 40 
Ser Lys Phe Leu Arg 
50 



Ala Gly Tyr Phe Gly Ala Glu Leu 

10 15 
Gly Ala Xaa lie Thr Leu Leu Tyr 
25 30 
Ala Glu Glu Leu Gly Ala Lys Val 
45 
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(2) INFORMATION FOR SEQ ID NO: 430: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:430: 



Met Leu Trp Lys Tyr Tyr Asn Ser 

1 5 
Leu Thr Thr Glu Tyr Lys Arg Leu 
20 

Lys Gin Ala Lys Pro lie Thr Met 

35 40 
Glu Gly Asn Pro Phe Val Thr Ala 

50 55 
Thr Gin Cys Glu Val Xaa Leu Asn 
65 70 



Ser Phe Leu Lys Ala Val Pro His 

10 15 
Ala Gin Ala His Gly Leu Asn Leu 
25 30 
Gly Met Trp He Gly Gly Asp Arg 
45 

Lys Asn Leu Lys Gin Ser Ala Leu 
60 

Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 431: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:431: 

Met Asn Thr Leu Tyr Leu Asn Gin His Ser Ser Tyr Lys Asn Asp Glu 

15 10 15 

Asp Ser His Ser Phe Pro He Gin Met Glu Leu Val Ser Asp Glu Met 

20 25 30 

He Pro Arg Asn 
35 



(2) INFORMATION FOR SEQ ID NO: 432: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 



Met Ala Arg Lys Glu Thr Met Ala 

1 5 
Lys Asp lie His Pro Glu Tyr Arg 
20 

Thr Gly Tyr Gin Phe Leu Ser Gly 

35 40 
Val Glu Phe Glu Gly Glu Thr Tyr 

50 55 
Ser Asp Ser His Pro Phe Tyr Thr 
65 70 
Asp Gly Arg Val Asp Arg Phe Asn 
85 



Glu Arg Lys Tyr Cys Lys Met Lys 

10 15 
Pro Val Val Phe Met Asp Thr Thr 
25 30 
Ser Thr Lys Arg Ser Asn Glu Thr 
45 

Pro Leu lie Arg Val Glu lie Ser 
60 

Gly Arg Gin Lys Phe Thr Gin Ala 

75 80 
Lys Lys Tyr Gly Leu Lys 
90 



(2) INFORMATION FOR SEQ ID NO: 433: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 



Met Glu Thr Ala Leu lie Ser Val 

1 5 
Gin Tyr Leu Glu Lys Ser lie Ala 
20 

Asn Leu Glu lie lie Leu Val Asp 

35 40 
Arg Leu Cys Asp Ser He Ala Glu 
50 55 



He Val Pro Val Tyr Asn Val Ala 

10 15 
Ser He Gin Lys Gin Thr Tyr Gin 
25 30 
Asp Gly Ala Thr Asp Glu Ser Gly 
45 

Gin Asp Asp Arg Val Ser Val Leu 
60 
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His Lys Lys Asn Glu Gly Leu Ser Gin Ala Arg Asn Asp Gly Met Lys 
65 70 75 80 

Pro Gly Ser Arg Gly Leu Ser Asp Phe Tyr 
85 90 



(2) INFORMATION FOR SEQ ID NO: 434: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 



Met Tyr Phe Leu lie Asn Phe Val 

1 5 
Leu Pro Phe Leu lie Asn Thr Gly 
20 

Ser Tyr He Ser Leu Leu Val Phe 

35 40 
Phe Leu Asn His Val Leu Ala Leu 
50 55 



Tyr Pro Val Asp Met Val He Asn 

10 15 
Leu He Val Leu Leu Ser Ala He 
25 30 
Thr Lys Asp Ser He Phe Tyr Glu 
45 

Lys Asn Lys Phe Lys Lys Ser 
60 



(2) INFORMATION FOR SEQ ID NO: 435: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

Met Arg Val He Ala Trp Leu Glu Pro Ser Arg Lys He Leu Gin Pro 

15 10 15 

Gly Ala Asn Asp Val Trp Val Val Lys Arg Lys Gly Lys Arg Asp Leu 

20 25 30 

Leu Leu Pro Tyr He Pro Pro Val Val Leu Asn Val Asp He Pro Asn 

35 40 45 
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Lys Arg Val Asp Val Glu lie Leu Glu Gly Leu Asp Asp Glu Asp 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 6: 

Met Asp Asp Asp Phe Lys Arg Tyr Asn Ala Pro He Leu Thr Trp Tyr 

15 10 15 

Glu Thr Ala Arg Tyr Ala Phe Glu Arg Gly Met Val Trp Gin Asn Leu 

20 25 30 

Gly Gly Val Glu Asn Ser Leu Asn Gly Gly Leu Tyr His Phe Lys Glu 

35 40 45 

Lys Phe Asn Pro Thr He Glu Glu Tyr Leu Gly Glu Phe Thr Met Pro 

50 55 60 

Thr His Pro Leu Tyr Pro Leu Leu Arg Leu Ala Leu Asp Phe Arg Lys 
65 70 75 80 

Thr Leu Arg Lys Lys His Arg Lys 
85 

(2) INFORMATION FOR SEQ ID NO: 437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 437: 

Met Glu Asp Phe Phe Ala Trp Cys Arg Arg Gin Ser Val Leu Ser Gly 

15 10 15 

Ser Lys Leu Gly Arg Ala He Glu Tyr Ser Leu Lys Tyr Glu Glu Thr 
20 25 30 
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Phe Lys Thr lie Leu Lys Asp Gly His Leu Val Leu Ser Asn Asn Leu 

35 40 45 

Ala Glu Arg Ala lie Lys Ser Leu Val Met Gly Arg Ser Lys Arg Val 

50 55 60 

Gin Trp Thr Leu Leu Ala 
65 70 



(2) INFORMATION FOR SEQ ID NO: 43 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(CJ STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 



Met Thr Lys Gin Gly Ser Trp Lys 

1 5 
Gly Arg Glu Leu Phe Pro lie Leu 
20 

Phe Asp Gin Gin Ala Pro Xaa lie 
35 40 



Leu lie Leu Glu Asp Glu Ser Ala 

10 15 
Thr Gin Gly Xaa Tyr lie Ala Thr 
25 30 
Asp Glu He Phe Lys Leu 
45 



(2) INFORMATION FOR SEQ ID NO: 439: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 



Met Thr Thr Asn Xaa Asp Asn Asp 

1 5 
Arg Lys Arg Thr Ala Gly Glu Thr 
20 

Thr His Gly Gly Asn Tyr Thr He 
35 40 



Thr Ser Glu Arg Ser Asp Cys Tyr 

10 15 
Pro Met Asn Asp Leu Thr Ser His 
25 30 
Ala Arg Tyr Gin Glu Glu Lys Phe 
45 
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Trp Asn Lys Gin Leu 
50 
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(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xii SEQUENCE DESCRIPTION: SEQ ID NO: 440: 

Met Ala Gin His Ala Ala Val Glu Ala Leu Thr Ala Gly Lys Asn Asp 

15 10 15 

Ala Glu Pro Met Lys Lys Glu Tyr lie Gin Arg Arg Asp Tyr lie lie 

20 25 30 

Glu Lys Met Thr Ala Leu Gly Phe Glu lie lie Lys Pro Asp Gly Ala 

35 40 45 

Phe Tyr He Phe Ala Lys He Pro Ala Gly Tyr Asn Gin Asp Ser Phe 

50 55 60 

Ala Phe Leu Lys Asp Phe Ala Gin Lys Lys Ala Val Ala Phe He Pro 
65 70 75 80 

Gly Ala Ala Phe Gly Arg Tyr Gly Glu Gly Tyr Val Arg Leu Ser Tyr 

85 90 95 

Ala Ala Ser Met Glu Thr He Lys Glu Ala Met Lys Arg Leu Glu Glu 

100 105 HO 

Tyr Met Arg Glu Ala 
115 

(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 441: 



323 



WO 97/37026 PCIYUS97/05306 

Met Asn lie Thr Asn Leu Phe Ser lie Lys Thr Gly Cys Asp Glu Thr 

15 10 is 

Asp Arg Gin Leu Gin Lys Leu Phe Phe Gin Leu Asp Leu Gin Leu Gly 

20 25 30 

Glu Leu Thr Asp Gin Leu Arg Lys Leu Asp Ser Asn Phe Val Pro Arg 

35 40 45 

Ser Gin Phe Val Asp Thr Leu Asp Leu Asn Asp Val Glu Tyr Lys Glu 

50 55 60 

He Leu Asn Tyr Phe He Phe His Arg Asn Asp Ser Glu Glu Ser Leu 
65 70 75 80 

Val Glu Trp Leu Tyr Asp Trp He Ser Thr Asn Arg Tyr Glu Leu Pro 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 442 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442: 

Met Leu Gin Trp He Asp Arg Thr Phe Gin Asp Tyr Leu Glu Leu Glu 

15 10 15 

His Leu Val Leu Thr Thr Trp Ser Gly Asn Leu Gly Met Met Lys Leu 

20 25 30 

Ala Glu Lys Leu Xaa Met Lys Lys Xaa Ala His He Xaa Lys Val Arg 

35 40 45 

Tyr Tyr Gin Gly Lys Tyr Phe Asp Ser He Lys Tyr Gly lie Leu Arg 

50 55 60 

Xaa Asp Trp Glu Lys He Asn Asp Gly Tyr Tyr Gin He Tyr Xaa Asn 
65 70 75 80 

Ser 



£2) INFORMATION FOR SEQ ID NO: 443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 

Met He Val Glu Glu Val Arg Asp Arg Tyr Val Gly Lys Val Asp Ala 

15 10 15 

Val Phe His Asn Gly Asp Ser Glu Leu Arg Pro Asp Ser Pro Leu Trp 

20 25 30 

Glu Gly He Arg Val Val Lys Gly Asn Met Asp Phe Tyr Ala Gly Tyr 

35 40 45 

Pro Glu Arg Leu Val Thr Glu Leu Gly Ser Thr Lys He He Gin Thr 

50 55 60 

His Gly His Leu Phe Asp He Asn Phe Asn Phe Gin Lys Leu Asp Tyr 
65 70 75 80 

Trp Ala Gin Glu Glu Glu Ala Ala He Cys Leu Tyr Gly His Leu His 

85 90 95 

Val Pro Ser Ala Trp Leu Glu Gly Lys He Leu Phe Leu Asn Pro Gly 

100 105 no 

Ser He Ser Gin Pro Arg Gly Thr He Arg Glu Cys Leu Tyr Ala Arg 

115 120 125 

Val Glu He Asp Asp Ser Tyr Phe Lys Val Asp Phe Leu Thr Arg Asp 

130 135 140 

His Glu Val Tyr Pro Gly Cys Pro Arg Ser Leu Ala Asp Asp Ala Lys 
145 150 155 160 

Glu Leu Arg Leu Ser Val Gly Ala Glu Asn Phe 
165 170 

(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: 3EQ ID NO: 444: 

Met Asn Leu His Gin Pro Leu His Val Leu Pro Gly Val Gly Pro Lys 
15 10 15 
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Ser Ala Glu Lys Tyr Ala Lys Leu Gly He Glu Asn Leu Gin Asp Leu 

20 25 30 

Leu Leu Tyr Phe Pro Phe Arg Tyr Glu Asp Phe Lys Thr Lys Gin Val 

35 40 45 

Leu Glu Leu Glu Asp Gly Glu Lys Ala Val Leu Ser Gly Gin Val Val 

50 55 60 

Thr Pro Ala Ser Val Gin Tyr Tyr Gly Phe Gin Ala Ala Asn Pro Pro 



(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 

Met Gin Leu Leu Ser Glu Asn Met Leu Lys Thr He Gin Ser Leu Ser 

15 10 15 

Val Trp Gin He Tyr Leu Leu Gly Phe Glu Arg He Leu Ala Leu Gly 

20 25 30 

Phe Gin Leu Leu Leu Thr Val Trp Val Tyr Gin Ala Val Arg Gin Lys 

35 40 45 

Lys Trp He Tyr Leu Leu Ala Ala Tyr Gly Leu His Ala Phe Phe Asp 

50 55 60 

Leu Ala Pro Ser Leu Phe Gin Val Gly Trp Leu Thr Asn Pro Val Leu 
65 70 75 80 

Val Glu Val He Leu Ala Leu Glu Leu Val Leu Val Ala Tyr Gly Thr 

85 90 95 

Lys Glu He Phe Cys Lys Lys Ser 



65 70 75 

Ala Phe Leu Val Leu Gin Ala Arg Glu Glu Gly 



Ser Phe Phe Phe Gly 
95 



80 



85 90 

Gly 



100 



(2) INFORMATION FOR SEQ ID NO: 446: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 122 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

Met Thr Lys Ser Met Thr Pro Asp Arg Glu Val lie Thr Phe He Pro 

15 10 15 

Glu Lys Phe He Val Asp Gly Phe Gin Gly He Arg Asp Pro Arg Gly 

20 25 30 

Met Met Gly Val Arg Leu Glu Met Arg Gly Leu Leu Tyr Thr Gly Pro 

35 40 45 

Arg Thr He Leu His Asn Leu Arg Lys Thr Val Glu Arg Ala Gly Val 

50 55 60 

Gin Val Glu Asn Val He He Ser Pro Leu Ala Met Val Gin Ser Val 
65 70 75 80 

Leu Asn Glu Gly Glu Arg Glu Phe Gly Ala Thr Val He Asp Met Gly 

85 90 95 

Ala Gly Gin Thr Thr Val Ala Thr He Arg Asn Gin Glu Leu Gin Phe 

100 105 HO 

Thr His He Leu Gin Glu Val Glu He Met 
115 120 

(2) INFORMATION FOR SEQ ID NO:447: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 

Met Leu Asn Ser He Val Thr He He Cys He Ala Leu He Ala Phe 

1 5 10 15 

He Leu Phe Trp Phe Phe Lys Lys Pro Glu Lys Ser Gly Gin Lys Ala 

20 25 30 

Gin Gin Lys Asn Gly Tyr Gin Glu He Aj:g Val Glu Val Met Gly Gly 
35 40 45 
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Tyr Thr Pro Glu Leu lie Val Leu Lys Lys Ser Val Pro Ala Arg lie 

50 55 60 

Val Phe Asp Arg Lys Asp Pro Ser Pro Cys Leu Asp Gin He Val Phe 
65 70 75 80 

Pro Asp Phe Gly Val His Ala Asn Leu Pro Met Gly Glu Glu Tyr Val 

85 90 95 

Val Glu He Thr Pro Glu Gin Ala Gly Glu Phe Gly Phe Ala Cys Gly 

100 105 HO 

Met Asn Met Met His Gly Lys Met He Val Glu 



(2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:448: 

Met Lys He Val Ser Gly He Tyr Gly Gly Arg Pro Leu Lys Thr Leu 

15 10 15 

Glu Gly Lys Thr Thr Arg Pro Thr Ser Asp Lys Val Arg Gly Ala He 

20 25 30 

Phe Asn Met lie Gly Pro Tyr Phe Glu Val Gly Xaa Val Leu Asp Leu 

35 40 45 

Tyr Ala Gly Ser Gly Gly Leu Ser He Glu Ala Val Ser Arg Gly Met 

50 55 60 

Ser Ser Ala Val Leu Val Glu Arg Asp Arg Lys Ala Gin Thr lie Val 
65 70 75 80 

Ala Glu Asn He Gin Met Thr Lys Glu Val Gly Lys Phe Gin Leu Leu 

85 90 95 

Gin Asn Gly Cys Arg Lys Gly lie Gly Thr Gly Xaa Leu Gly Asn Leu 

100 105 110 

Thr Ser Phe Ser 



115 



120 



115 



(2) INFORMATION FOR SEQ ID 



NO:449: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 52 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 

Met xaa Thr Asn Phe Thr Asp Gin Val Asp Thr Met He Tyr val Asp 

Lys Glu Glu Lys Glu Thr He Lys Ala Ala Leu Val Glu Phe Phe Asn 

20 25 3 0 

Gly Lys Val Thr Leu Thr Asp Gin Gly Leu Arg Glu Val Glu Val Pro 

35 40 45 

Val Asn Leu Val 
50 

(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 

Met He Gly Val Ala Leu Glu Glu Ser Pro Thr Leu Lys He His His 

Leu Thr His Asp He Thr Pro Tyr Asn lie Phe Glu Gly Ser Tyr Arg 

20 25 30 

Leu Phe Gin Thr Val Asp Tyr Trp Pro Glu Gly Thr Thr Phe Val Ser 

35 40 45 

Val Val Asp Pro Gly Val Gly Ser Lys Arg Lys Ser Val Val Ala Lys 

50 55 60 

Thr Ala Gin Lys Ser He His Cys His Ala Arg 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 130 amino acids 
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<B) TYPE : amino acid 

tC) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 

Met Glu Leu Arg Arg Pro Arg Leu Ala Asp Lys Lys Ala Val Leu Asp 

15 10 15 

Met Met Thr Glu Phe Glu Lys Ser Gin Ser Ala His Asp Gly Gly Phe 

20 25 30 

Trp Asp Thr Glu Asn Phe Val Tyr Glu Glu Trp Leu Glu Ser Asn Gin 

35 40 45 

Glu Gin Glu Met Gly lie Asn Leu Pro Glu Gly Trp Val Pro Ala He 

50 55 60 

Gin Leu Val Ala Phe Ser Glu Lys Gly Gin Ala Val Gly Phe Leu Asn 
65 70 75 80 

Leu Arg Leu Arg Leu Ser Asn Phe Leu Leu Glu Glu Gly Gly His He 

85 90 95 

Gly Tyr Ser He Arg Pro Ser Glu Arg Gly Lys Gly Tyr Ala Lys Glu 

100 105 HO 

Leu Ser Val Arg Ala Cys Lys Leu Leu Arg Lys Arg Thr Ser Arg Lys 
115 120 125 

Leu Trp 
130 

(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 

Met Thr Phe He Arg Gly Pro lie He Ser Thr Val Gly Glu Gly Vai 

1 5 10 15 

Xaa He Leu Ala Thr Val Asn Xaa Gin Phe Val Xaa Xaa Gin Xaa Xaa 

20 25 30 
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Asn Met Leu Val Ser Pro Phe His Pro Glu Leu Thr Asp Xaa Xaa Arg 

35 40 45 

Leu Xaa Gin Tyr Phe lie Ser Met Cys Lys Glu Lys Ser Xaa Asp 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 

Met Lys Gly Asp Val He Thr Glu His Gin Phe Tyr Glu Gin Val Lys 

15 10 15 

Asn Asn Pro Ser Ala Gin Xaa Val Leu Leu Asn Met Thr He Gin Lys 

20 25 30 

Val Phe Glu Lys Thr He Trp Leu Arg Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454: 

Met Asp Lys Ala Leu Ala Asp Leu Lys Thr Ser Gly His Leu Pro He 

1 5 10 15 

Pro Arg His Leu Arg Asp Gly His Tyr Xaa Gly Ser Lys Glu Leu Gly 

20 25 30 

Asn Ala Gin Asp Tyr Leu Tyr Pro His Asn Tyr Pro Xaa Asn Trp Val 

35 40 45 

Lys Gin Asp Tyr Leu Pro Gin Lys He Arg Asn His His Tyr Phe Gin 
50 55 60 
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Ala Glu Tyr Thr Gly Lys Tyr Glu Arg Ala Leu Ala Gin Arg Lys Glu 

65 70 75 80 

Ala lie Asp His Leu Arg Lys lie 
85 



(2) INFORMATION FOR SEQ ID NO: 455: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 

Met Val Val Phe Thr Gly Ser Thr Val Glu Glu Ala He Gin Lys Gly 

1 5 10 15 

Leu Lys Glu Leu Asp He Pro Arg Met Lys Ala His He Lys Val He 

20 25 30 

Ser Arg Glu Lys Lys Gly Phe Leu Gly Leu Phe Gly Lys Lys Pro Ala 

35 40 45 

Gin Val Asp He Glu Ala He Ser Glu Thr Thr Val Val Lys Ala Asn 

50 55 60 

Gin Gin Val Val Lys Gly Val Pro Lys Lys He Asn Asp Leu Asn Glu 
65 70 75 80 

Pro Val Lys Thr Val Ser Val Arg Asn Arg 
85 90 



(2) INFORMATION FOR SEQ ID NO:456: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:456: 

Met Thr Lys Arg Cys Ser Trp Val Lys Met Thr Asn Pro Leu Tyr I 
15 10 i5 
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Ala Tyr His Asp Glu 
20 

Leu Phe Glu Leu Leu 
35 

Glu Thr Val Leu Asn 
50 



Glu Trp Gly Gin Pro Leu 
25 

Cys Met Glu Thr Tyr Gin 
40 

Lys Arg Gin Ala Phe Arg 
55 
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His Asp Asp Gin Ala 
30 

Ala Gly Leu Ser Trp 
45 

Glu Ala Phe Pro 
60 



(2) INFORMATION FOR SEQ ID NO: 457: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:457: 



Met Phe Val Gly Gin Glu Trp Thr 

1 5 
Gly Asn His Gin Gly Gin Val Thr 
20 

Phe Pro Val Ser Ala Arg Ser Val 
35 40 



Asn Gin Thr Phe Val Asp Leu Leu 

10 15 
He Asp Glu Glu Gly Tyr Gly Gin 
25 30 
Ser Val Trp Ala Val Asn Thr He 
45 



(2) INFORMATION FOR SEQ ID NO: 458: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 



Met Ala Gin Val Asp He He His Gly lie Gly Thr Gly Val He Arg 

15 10 15 

Glu Gly Val Thr Lys Tyr Leu Gin Arg Asn Lys His Val Lys Ser Phe 

20 25 30 

Gly Tyr Ala Pro Gin Asn Ala Gly Gly Ser Gly Ala Thr lie Val Thr 
35 40 45 
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(2) INFORMATION FOR SEQ ID NO:459: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 

Met Asn Leu Leu Glu Glu Ala Thr Lys Gly Lys Val lie Phe Glu Gly 

15 10 15 

Val Asp lie Thr Asp Lys Lys Asn Asp Leu Phe Pro Met Arg Glu Lys 

20 25 30 

Met Gly Met Val Phe Gin Gin Phe Asn Xaa Phe Leu lie 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 460: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 60: 

Met Gin He Asp Leu Ala Asn Leu Leu Pro Asp Leu Pro Gly Asn Leu 

15 10 15 

Ser Gly He Asn Pro Asn Arg Tyr Val Phe Tyr Gin Asp Val Leu Cys 

20 25 30 

Pro He Leu Asp Arg His Met Thr Pro Glu Gin Asp Lys Pro His Phe 

35 40 45 

Ala Gin Ala Ala Gly Thr He Ala Asp lie Lys Glu Lys Ala Gly Asn 

50 55 60 

Tyr Ala Tyr Leu Phe Glu Thr Gin Ala Gin Leu Asn Ala He Leu Ser 
65 70 75 80 
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Ser Lys Val Asp Val Gly Arg Arg lie Arg His Ala Tyr Gin Ala Asp 

85 90 95 

Asp Lys Glu Ser Leu Gin Gin lie Ala Arg Gin Glu Leu Pro Glu Leu 

100 105 HQ. 

Arg Ser Gin lie Glu Asp Phe His Ala Leu Phe Ser His His Trp Leu 
115 120 125 

Lys 



(2) INFORMATION FOR SEQ ID NO:461: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461: 

Met Arg Gin Gly lie Asp Ser Leu Ala Tyr Val Val Lys Thr His Phe 

15 10 15 

Glu Leu Asp Pro Phe Ser Gly Gin Ala Phe Leu Phe Cys Gly Gly Arg 

20 25 30 

Lys Asp Arg Phe Lys Ala Leu Tyr Trp Asp Gly Gin Gly Phe Trp Leu 

35 40 45 

Leu Tyr Lys Arg Phe Glu Asn Gly Lys Leu Thr 
50 55 

(2) INFORMATION FOR SEQ ID NO: 462: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 

Met Asn Ala Phe Leu Glu Glu Leu Ser Gin Ala Tyr Pro Asp Asp Tyr 
15 10 15 
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Phe Leu Leu Val Met Asp Asn Ala 
20 

Lys lie Pro Thr Asn lie Gly Phe 

35 40 
Glu Met Asn Pro lie Glu Gin Val 

50 55 
Phe Lys Asn Lys Ala Phe Arg lie 
65 70 
Gin Asp Val lie Gin Gly Leu Glu 
85 

Asn Arg Arg Trp Thr Arg Met Leu 
100 
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He Trp His Lys Ser Ser Val Leu 
25 30 
Ala Phe He Pro Pro Tyr Thr Pro 
45 

Trp Lys Glu He Arg Lys Arg Gly 
60 

Leu Glu Asp Val Met Asn Gin Leu 

75 80 
Lys Glu Val He Lys Ser He Val . 

90 95 
Phe Glu Ser Arg 
105 



(2) INFORMATION FOR SEQ ID NO: 4 63: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 

Met Thr Val Glu Glu Glu Lys Val Phe Leu Ala Arg His Leu Lys Ala 

1 5 10 15 

Thr Glu Ala Gly Glu Phe Val Thr He Asp Ala Leu Phe Gin Ala Tyr 

20 25 30 

Lys Lys Glu Leu Gly Arg Ser Tyr Thr Arg Asp Ala Phe Tyr Gin Leu 

35 40 45 

Leu Lys Arg His Gly Trp Arg Asn He Met Pro Arg Pro Glu His Pro 

50 55 60 

Lys Lys Ala Asp Ala Gin Thr He Val Ala Ser Lys Asn Lys He Ser 
65 70 75 80 

lie. Gin Glu Asp Lys Lys Ala He 
85 



(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 
£A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464 ; 

Met Leu Ser Leu Val Val Leu lie Thr Gin Tyr Ala Thr His Gly Glu 

15 10 15 

Leu Phe Glu Asn Tyr lie Ser Lys Glu Asn Asp Asn Val He Lys Glu 

20 25 30 

Tyr Gin Asn Thr Thr Gly Phe Ser Thr Pro Tyr Thr Leu Asp Gly Ser 

35 40 45 

Val Leu He Val His Pro Asp Leu Thr Lys Gly Met Asn He Glu Gly 

50 55 60 

Tyr Ser Asp Leu Leu Lys Pro Glu Leu Lys Gly Lys He Ala Thr Ala 
65 70 75 80 

Asp Pro Ala Asn Ser Ser Ser Ala Phe Ala Gin Leu Thr Asn Met Leu 

85 90 95 

Gin Ala Gin Gly Gly Tyr Lys Asp Asp Leu Ala Trp Ser Val Leu Ser 

100 105 110 

Thr He Phe Ser Thr Leu He Asp Gly He Val Lys 



(2) INFORMATION FOR SEQ ID NO: 465: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 

Met Leu Ala Arg Lys Asp Arg Phe Tyr Tyr He Asp Lys Ala Thr Gly 

15 10 15 

Lys Ala Leu Gly He Phe Leu Ser Cys Val Leu Thr Ser Asn Gly Val 

20 25 30 

He Glu Val Gly Ala Val Thr Phe Ser Pro Lys Leu Arg Gly Thr Arg 



115 



120 



35 



40 



45 
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lie Gly Thr Glu Ala Gin Tyr Leu Leu Ala Arg Tyr Val Phe Glu Glu 

50 55 60 

Leu Asn Tyr Arg Arg Tyr Glu Trp Lys Cys Asp Ala Leu Asn Leu Pro 
65 70 75 80 

Ser Arg Arg Ala Ala Glu Arg Leu Gly Phe lie Tyr Glu Gly Thr Phe 

85 90 95 

Arg Gin Ala Val Val Tyr Lys Gly Arg Thr Arg Asp Met Asp Trp Leu 

100 105 HO 

Ser Met lie Asp Lys Asp Trp Pro Lys Val Lys Asp Arg Leu Glu Thr 

115 120 125 

Trp Leu Arg Pro Glu Asn Phe Asp Lys Asn Gly Gin Gin Tyr Lys Ser 

130 135 140 

Leu Arg Glu Leu 
145 

(2) INFORMATION FOR SEQ ID NO:466: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 

Met Met Thr Ser Asp Phe Phe Glu Val Trp Phe Gin Lys Phe Leu Leu 

15 10 15 

Pro Thr Leu Thr Thr Pro Ser Val lie lie Val Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 
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Met Val Ser Thr Lys Thr Gin lie Ala Gly Phe Glu Phe Asp Asn Cys 

15 10 15 

Leu Met Asn Ala Ala Gly Val Ala Cys Met Thr lie Glu Glu Leu Glu 

20 25 30 

Glu Val Lys Asn Ser Ala Ala Gly Thr Phe Val Thr Lys Thr Ala Thr 

35 40 45 

Leu Asp Phe Arg Gin Gly Asn Pro Glu Pro Arg Tyr Gin Asp Val Pro 

50 55 60 

Leu Gly Ser lie Asn Ser Met Gly Leu Pro Asn Asn Gly Leu Asp Tyr 
65 70 75 80 

Tyr Leu Asp Tyr Leu Leu Asp Leu Gin Glu Lys Glu Ser Asn Arg Thr 

85 90 95 

Phe Phe Leu Ser Leu Val Gly Met Ser Pro Glu Lys Pro lie Leu Phe 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 468 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 



Met Thr Tyr Leu Met Phe Ser Gly Leu Asp Glu Asn Phe Tyr His Phe 

15 10 15 

Pro Trp Glu Val Phe Ala Gly Phe Gly lie lie Phe Leu Ala Cys Pro 
20 25 30 

Glu Lys Val 
35 



(2) INFORMATION FOR SEQ ID NO: 469: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 



Met Phe Thr Gly Trp Phe Ala Gin 

1 5 
Gly Ala Trp Gly Leu Gly Phe Met 
20 

Lys Glu Leu Leu Glu Gly Glu Tyr 
35 40 



Thr Phe Leu His Gin Phe lie Arg 

10 15 
lie Phe lie Ala Phe Pro Met Gly 
25 30 
His Glu His Asp 



(2) INFORMATION FOR SEQ ID NO: 470: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 



Met Gin Glu Lys lie Leu Val Thr Gly Gly Ala Gly Phe lie Gly Thr 

15 10 15 

His Thr Val lie Glu Leu He Gin Ala Gly His Gin Val Val Val Val 

20 25 30 

Asp Asn Leu Val Asn Ser Asn Xaa Lys Ser Leu Glu Val Val Glu Arg 

35 40 45 

He Thr Gly Val Glu He Pro Phe Tyr Glu Ala Asp He Xaa Asp Thr 

50 55 60 

Asp Thr Leu Arg Asp He Phe Lys Gin Glu Glu Leu Thr Gly Val He 
65 70 75 80 

His Phe Ala Gly Leu Lys Ala Val Gly Glu Ser Thr Arg He Pro Leu 

85 90 95 

Ala Tyr Tyr Asp Asn Asn He Ala Gly Thr Val Ser Leu Leu Lys Ala 

100 105 HO 

Met Glu Glu Asn Asn Cys Lys Asn He He Phe Ser Ser Xaa Ala Thr 

115 120 125 

Val Tyr Gly Asp Pro Tyr Thr Val Pro He Leu Glu Val Ser His Phe 
130 135 140 

Gin 
145 



(2) INFORMATION FOR SEQ ID NO: 471: 



340 



WO 97/37026 PCI7US97/05306 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 200 amino acids 
<B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 



Met Val Lys Glu Lys Val Thr Leu Thr Pro Leu Ala Ser Gly Tyr Gin 

15 10 15 

He Gly Glu Glu Glu Phe Glu Gin Val He Leu Ala Xaa Gly Ala Trp 

20 25 30 

Leu Gly Asp Met Leu Glu Pro Leu Gly Tyr Glu Val Asp Val Arg Pro 

35 40 45 

Gin Lys Gly Gin Leu Arg Asp Tyr Gin Leu Val Gin Asp Met Glu Ala 

50 55 60 

Tyr Pro Val Val Met Pro Glu Gly Glu Trp Asp Leu He Pro Phe Ala 
65 70 75 80 

Gly Gly Lys Leu Ser Leu Gly Ala Thr His Glu Asn Asp Met Gly Phe 

85 90 95 

Asp Leu Thr Val Asp Glu Thr Leu Leu Gin Gin Met Glu Glu Ala Thr 

100 105 110 

Leu Thr His Tyr Leu He Leu Ala Glu Ala Thr Ser Lys Ser Glu Arg 

115 120 125 

Val Gly He Arg Ala Tyr Thr Ser Asp Phe Ser Pro Phe Phe Gly Gin 

130 135 140 

Val Pro Asp Leu Thr Gly Val Tyr Ala Ala Ser Gly Leu Gly Ser Ser 
145 150 155 160 

Gly Leu Thr Thr Gly Pro He He Gly Tyr His Leu Ala Gin Leu He 

165 170 175 

Gin Asp Lys Glu Leu Thr Leu Asp Pro Glu Asn Tyr Pro He Glu Asn 

180 185 190 

Tyr Val Lys Arg Val Lys Ser Glu 
195 200 

(2) INFORMATION FOR SEQ ID NO: 472: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 



Met Glu His lie He Tyr Gin Phe Glu Glu Asp Leu Ala He Leu Thr 

15 10 15 

Leu Asn Arg Pro Glu Val Ala Asn Gly Phe His He Pro Met Cys Glu 

20 25 30 

Glu He Leu Glu Ala Leu Thr Leu Ala Glu Glu Asn Pro Ala Val His 

35 40 45 

Phe He Leu He Asn Ala Asn Gly Lys Val Phe Ser Val Gly Gly Asp 

50 55 60 

Leu Val Glu Met Lys Arg Ala Val Asp Glu Asp Asp He Pro Ser Leu 
65 70 75 80 

Thr Lys He Ala Glu Leu Val Asn Thr He Ser Tyr Lys He Lys Gin 

85 90 95 

He Ala Lys Pro Val Phe Asn Gly Ser 
100 105 



(2) INFORMATION FOR SEQ ID NO: 473: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 



Met Gin Pro Glu Val He Trp Leu 

1 5 
Met Lys Val Leu Asn Thr Thr Asp 
20 

Val Asp He He Gin Asn Ser Leu 

35 40 
Leu He Lys Glu Tyr Gin Glu Asp 

50 55 
Glu Lys Asp He Gin Thr Arg Thr 
05 70 
Asn 



Gin Lys He Ala Lys Pro Ser Gin 

10 15 
Arg Val Gin Ala He Lys Asp Asp 
25 30 
Gin He He Asn Gin Gin Lys Glu 
45 

Leu Thr Tyr Lys Phe Lys Val Leu 
60 

Ser Cys Asp Lys Arg Asn Ala Gly 

73 80 
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(2) INFORMATION FOR SEQ ID NO: 474: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 

Met Gly Ser Asn Ser Leu Thr Leu Leu Leu Lys lie Gin Ala Thr lie 

15 10 15 

Pro Arg Asn lie Tyr Asp Asn Leu Gin Ala Gly Ser Lys Val Thr Leu 

20 25 30 

Asp Arg Ala Tyr Gly His Met lie lie Glu Glu Gly Arg Glu Asn Gin 

35 40 45 

Val Trp lie Ala Gly Gly lie Gly He Thr Pro Phe He Ser Tyr He 

50 55 60 

Arg Glu His Pro He Leu Asp Lys Gin Val His Phe Tyr Tyr Ser Phe 
65 70 75 80 

Arg Gly Asp Glu Asn Ala Val Tyr Leu Asp Leu Leu Arg Asn Tyr Ala 

85 90 95 

Gin Lys Asn Pro Asn Phe Glu Leu His Leu He Asp Ser Thr Lys Asp 

100 105 110 

Gly Tyr Leu Asn Phe Glu Gin Lys Lys Cys Pro Asn Met Gin Pro Ser 

115 120 125 

He Cys Val Val Leu Phe Leu 
130 135 

(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 
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Met Phe Phe Val Leu Ala Tyr Ala Lys Ala Val Ala Gly Arg Asn lie 

1 5 10 15 

Lys Leu His Pro lie Gin Arg Gin Gly Ala Gly Tyr His Ser Val Asn 

20 25 30 

Lys Asp Val Asp Phe Ala Ser Ala Thr Ala Leu Arg Gin His Gin Lys 

35 40 45 

Asp Gin Asp Phe Leu Glu Arg Phe Met Pro Ser Val Ala Leu Phe Glu 

50 55 60 

Gin Ala Ser Lys Val He Trp Glu Asp Tyr Phe Pro Leu Leu Arg Tyr 
65 70 75 80 

Gin He Leu Ser Asn Pro Asp Leu Thr Thr He Tyr Gin Val Asn Gin 

85 90 95 

Glu Met Ala Val Arg He Lys Glu Ala He Lys Thr Ala Gin Ser Val 

100 105 110 

Glu Glu Leu Val Glu Leu Val Thr Thr Lys Arg Tyr Thr Lys Ala Arg 

115 120 125 

Val Arg Arg Leu Leu Thr Tyr He Leu Met Gin Ala Arg Glu Ser Asp 

130 135 140 

Leu Pro Glu Ala He His Val Leu Gly Phe Thr Glu Lys Gly Arg Gin 
145 150 155 160 

His Leu Lys Ser Leu Lys Gly Gin Val Asn Leu Val Ser Arg He Gly 

165 170 175 

Lys Glu Pro Trp Asp Ala Met Thr Gin Lys Val Asp Gin He Tyr Gin 

180 185 190 

Leu Gly Lys Pro Ser He Ala Glu Gin Asn Phe Gly Arg Val Pro He 

195 200 205 

Arg He Glu Thr Asn 
210 

(2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C} STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:476: 

Met Leu Gly Ser Met Phe Val Gly Leu Leu Val Gly Phe Leu Ala Gly 
15 10 15 
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Ala Met Thr Asn Arg Gly Glu Arg Met Gly Cys 

20 25 
Leu Gly Trp lie Gly Ala Phe Leu Gly His Leu 

35 40 
Gly Pro Val Leu Ser Gly Thr Ala lie lie Pro 

50 55 
Met He Val Leu Ala He Phe Trp Arg Arg Gly 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 477: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 

Met Arg Ser Ser Pro Ser Arg Glu He Gin Pro Arg Ser Trp Gly He 

15 10 15 

Thr Thr Val Tyr Val Thr His Asp Gin Glu Gly Ala Met Ala He Ser 

20 25 30 

Asp Gin He Ala Cys Tyr Glu Arg Trp Gly Asp Pro Thr Asn Arg Pro 

35 40 45 

Thr Lys Arg Thr Val Ser 
50 
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Phe Gly Lys Met Phe 
30 

Leu Phe Gly Thr Trp 
45 

Ala val Leu Gly Ala 
60 
Ser 
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1 . An isolated polynucleotide comprising a polynucleotide sequence selected from 
the group consisting of: 

S (a) a polynucleotide having al least a 70% identity to a polynucleotide encoding a 

polypeptide comprising an amino acid sequence of selected from the group consisting of SEQ ID 
NO:224 through 477, 

(b) a polynucleotide which is complementary to the polynucleotide of (a); 

(c) a polynucleotide having at least a 70% identity to a polynucleotide encoding the 
1 0 same mature polypeptide expressed by the a gene comprising a polynucleotide sequence selected 

from the group consisting of SEQ ID NO: 1 through 223 contained in the DNA of the deposited 
clone; and 

(d) a polynucleotide comprising at least 1 5 sequential bases of the polynucleotide 
of (a), (b)or(c). 

15 2. The polynucleotide of Claim 1 wherein the polynucleotide is DNA. 

3. The polynucleotide of Claim 1 wherein the polynucleotide is RNA. 

4. The polynucleotide of Claim 2 comprising the nucleic acid sequence set forth in 
SEQ ID NO: 1. 

5. The polynucleotide of Claim 2 comprising a nucleotide sequence selected from 
20 the group consisting of SEQ ID NO: 1 through 223. 

6. The polynucleotide of Claim 2 which encodes a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:224 through 477. 

7. A vector comprising the polynucleotide of Claim 1 . 

8. A host cell comprising the vector of Claim 7. 

25 9. A process for producing a polypeptide comprising: expressing from the host 

cell of Claim 8 a polypeptide encoded by said DNA. 

10. A process for producing a noval polypeptide or fragment comprising 
culturing a host of claim 8 under conditions sufficient for the production of said polypeptide 
or fragment. 

30 11. A polypeptide comprising an amino acid sequence which is at least 70% 

identical to an amino acid sequence selected from the group consisting of SEQ ID NO:224 
through 477. 

12. A polypeptide comprising an amino acid selected from the group consisting of 
SEQ ID NO:224 through 477. 
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13. An antibody against the polypeptide of claim 11. 

1 4. An antagonist which inhibits the activity or expression of the polypeptide of 
claim 11. 

15. A method for the treatment of an individual in need of a Streptococcal 

5 polypeptide comprising: administering to the individual a therapeutically effective amount of the 
polypeptide of claim 1 1 . 

1 6. A method for the treatment of an individual having need to inhibit a bacterial 
polypeptide comprising: administering to the individual a therapeutically effective amount of the 
antagonist of Claim 14. 

0 17. A process for diagnosing a disease related to expression or activity of the 

polypeptide of claim 1 1 in an individual comprising: 

(a) determining a nucleic acid sequence encoding said polypeptide, and/or 

(b) analyzing for the presence or amount of said polypeptide in a sample derived from 
the individual. 

5 1 8. A method for identifying compounds which interact with and inhibit or activate 

an activity of the polypeptide of claim 1 1 comprising: 

contacting a composition comprising the polypeptide with the compound to be screened 

under conditions to permit interaction between the compound and the polypeptide to assess the 

interaction of a compound, such interaction being associated with a second component capable of 
0 providing a detectable signal in response to the interaction of the polypeptide with the 

compound; 

and determining whether the compound interacts with and activates or inhibits an 
activity of the polypetide by detecting the presence or absence of a signal generated from the 
interaction of the compound with the polypeptide, 

5 19. A method for inducing an immunological response in a mammal which 

comprises inoculating the mammal with Streptococcal polypeptide of claim 1 1, or a fragment 
or variant thereof, adequate to produce antibody and/or T cell immune response to protect 
said animal from disease. 

20. A method of inducing immunological response in a mammal which comprises 

) delivering a nucleic acid vector to direct expression of Streptococcal polypeptide of claim 1 1 , 
or fragment or a variant thereof, for expressing said Streptococcal polypeptide, or a fragment 
or a variant thereof in vivo in order to induce an immunological response to produce 
antibody and/ or T cell immune response to protect said animal from disease. 
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21. An isolated Streptococcal polypeptide having one of the amino acid 
sequences of SEQ ID NO:224 through SEQ ID NO:234. 

22. An isolated nucleic acid encoding one of the amino acid sequences of 
Claim 21 and nucleic acid sequences capable of hybridizing therewith under stringent 

5 conditions. 

23. Recombinant vectors comprising the nucleic acid sequences of 
Claim 22 and host cells transformed or transfected therewith. 

24. A method of identifying an antimicrobial compound comprising contacting 
candidate compounds with a polypeptide of Claim 21 and selecting those compounds 

10 capable of inhibiting the bioactivity of said polypeptide. 

25. Antimicrobial compounds identified by the method of Claim 24. 
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